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CARBOHYDRATE-BINDING MODULES OF A NEW FAMILY Modtaget 
FIELD OF THE INVENTION 

The present invention relates to non-catalytic carbohydrate-binding modules (CBM) belonging 
to a new family of CBM's. A CBM of the invention was found attached to a glycosyl hydrolase 
family 61 (GH61) polypeptide and was shown to have little homology with known CBM's indi- 
cating that it is the first known member of a new family of CBM's. The present invention fur- 
ther relates to CBM's preferably exhibiting binding affinity for cellulose; to a method of produc- 
ing such CBM's; and to methods for using such CBM's in the textile, detergent and cellulose 
fiber processing industries, for purification of polypeptides, immobilisation of active enzymes, 
baking, manufacturing of biofuel, modification of plant cell walls. 

BACKGROUND OF THE INVENTION 

A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence 
within a carbohydrate-active enzyme with a discrete fold having carbohydrate-binding activity. 
The requirement of CBM's existing as modules within larger enzymes sets this class of car- 
bohydrate-binding protein apart from other non-catalytic sugar binding proteins such as 
lectins and sugar transport proteins. 

CBM's were previously classified as cellulose-binding domains (CBD's) based on the initial 
discovery of several modules that bound cellulose (Tomme et al. (1989) FEBS Lett 243, 239- 
243; Gilkes et al. (1988) J. Biol. Chem. 263, 10401-10407). However, additional modules in 
carbohydrate-active enzymes are continually being found that bind carbohydrates other than 
cellulose yet otherwise meet the CBM criteria. 

Previous classification of cellulose-binding domains was based on amino acid similarity. 
Groupings of CBD's were called "Types" and numbered with roman numerals (e.g. Type I or 
Type II CBD's). In keeping with the glycoside hydrolase classification, these groupings are 
now called families and numbered with Arabic numerals. Families 1 to 13 are the same as 
Types I to XIII (Tomme et al. (1995) in Enzymatic Degradation of Insoluble Polysaccharides 
(Saddler & Penner eds.) 142-163, American Chemical Society, Washington). 
Presently the known CBM's are classified in families 1-6 and 8-33. Most classified CBM's 
are of bacterial origin, and the known fungal carbohydrate-binding modules are mainly 
classified in the family CBM1. However, representatives of fungal CBM's are also found in 
CBM13, CBM18, CBM19, CBM20 and CBM24. Until now, only the fungal carbohydrate- 
binding modules from CBM1 were known to bind to crystalline cellulose. The fungal CBM's 
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from families CBM13, CBM18, CBM19, CBM20 and CBM24 have been shown to bind to 
substrates such as chitin, starch and mutan. However, also the fungal carbohydrate- 
binding modules from CBM1 bind very well to chitin. 

5 A number of fungal cellulases has been shown to contain a CBD of family CBM1 consist- 
ing of 36 amino acid residues. Examples of enzymes known to contain such a domain are: 

- Endoglucanase I (gene eg!1) from Trichoderma reeseL 

- Endoglucanase II (gene egl2) from Trichoderma reeseL 
10 - Endoglucanase V (gene eg!5) from Trichoderma reeseL 

- Exocellobiohydrolase I (gene CBHI) from Humicola grisea, Neurospora crassa, 
Phanerochaete chrysosporium, Trichoderma reesei, and Trichoderma viride. 

- Exocellobiohydrolase II (gene CBHII) from Trichoderma reeseL 

- Exocellobiohydrolase 3 (gene ce!3) from Agaricus bisporus. 
15 - Endoglucanases B, C2, F and K from Fusarium oxysporum. 

The CBD domain is found either at the N-terminal (Cbh-ll or egl2) or at the C-terminai ex- 
tremity (Cbh-I, egll or egl5) of these enzymes. There are four conserved cysteine residues 
in this type of CBD domain, all of which are involved in disulfide bonds. (Prosite, Swiss In- 
2 0 stitute of Bioinformatics). 

A DNA sequence encoding a CBD from a given organism can be obtained conventionally 
by using PCR techniques, and, also based on current knowledge; it is possible to find ho- 
mologous sequences from other organisms. 
25 It is contemplated that new CBD's can be found by cloning cellulases, xylanases or other 
plant cell wall degrading enzymes and measure the binding to e.g. cellulose. If the enzyme 
activity is bound to Avicel under the standard conditions described below, it can be as- 
sumed that part of the gene codes for a binding domain. 

30. Examples of CBM-like polypeptides obtainable from plants are expansins. Expansins are 
not CBM's per se because they are not found encoded in the same amino acid sequence 
with an enzyme activity. However, it has been observed that isolated CBM domains can 
have expansin like activity on cellulose (Levy and Shoseyov, 2002 supra). Din et al. 
(Bio/Technology 9 (1991) 1096-1099) has reported that the CBD CenAfrom Cellulomonas 

35 fimi endoglucanase A is capable of nonhydrolytic disruption activity of cellulose fibers re- 
sulting in small particle release. Furthermore, it was shown that CBD CenA could prevent 
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the floccuiation of microcrystalline bacterial cellulose (Gilkes et al. (1993) Int. J. Biol. Mac- 
romol. 15:347-351). Similar phenomena were observed for other CBD's (Krull et al. (1988) 
Biotechnol. Bioeng. 31:321-327; Banka et al. (1998) World J. Microbiol. Biotechnol. 
14:551-558; Gao et al. (2001) Acta Biochim. Biophys. Sin. 33:13-18). 

CBM's are known to be used in applications as diverse as washing, treatment of textile, re- 
moval of dental plaque, purification of polypeptides, immobilisation of active enzymes, modi- 
fication of cellulosic material, baking, manufacturing of biofuel, modification of plant cell walls. 

SUMMARY OF THE INVENTION 

The inventors have now found a carbohydrate-binding module (CBM) obtainable from the 
fungus Pseudoplectania nigrella (deposited under No. CBS 444.97) having binding affinity for 
cellulose, which novel CBM was found bound to an enzyme belonging to family 61 of the gly- 
cosyl hydrolases (GH61). The novel CBM (called CBMX) was shown to have affinity for 
Avicel® and had no observable homology (below 20%) to known CBM's. Also, none of the 
positions of the cysteine residues found on the CBM of the invention correspond to the posi- 
tions of the well conserved cysteine residues in the family CBM1 described above. This indi- 
cates that the CBM of the invention is the first known member of a new family of CBM's. 

0 Apart from the fungal CBM's of family CBM1 which have binding affinity for cellulose, the 
CBM of the invention is the first known fungal CBM shown to have binding affinity for cellu- 
lose. 

The inventors have succeeded in cloning and expressing a CBM bound to a family GH61 
enzyme. In addition, the inventors have expressed the domain only, without the GH61 en- 
5 zyme and demonstrated that the CBM alone can bind cellulose, such as Avicel. 

Said CBM domain is encoded by the DNA sequence of positions 109-531 of SEQ ID NO:1 
and has the amino acid sequence of positions 34-174 of SEQ ID NO:2. Positions 1-33 of 
SEQ ID NO:2 constitutes a signal peptide and an N-terminal region of the GH61 enzyme. 

0. Accordingly, the present invention relates to a CBM of a new family of CBM's which CBM is 
(a) a polypeptide encoded by the DNA sequence of positions 109-531 of SEQ ID NO:1. or 
a DNA sequence homologous to SEQ ID NO:1, which DNA sequence has at least 40% 
identity with positions 109-531 of SEQ ID NO:1, preferably at least 50% identity, more 
preferably at least 60% identity, more preferably at least 70% identity, more preferably at 

5 least 80%. more preferably at least 85%, more preferably at least 90%, more preferably at 
least 9 5% i dentity, m ore p referably a 1 1 east 9 7% i dentity, m ore p referably a t least 9 8% 
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identity, even more preferably at least 99% identity with positions 109-531 of SEQ ID 
NO:1; 

(b) a polypeptide produced by culturing a cell comprising the DNA sequence of positions 
109-531 of SEQ ID NO:1 under conditions wherein the DNA sequence is expressed; 

(c) a polypeptide having the amino acid sequence of positions 34-174 of SEQ ID NO:2 , or 
a polypeptide homologous to SEQ ID NO:2, which polypeptide has an amino acid se- 
quence of at least 40% identity with positions 34-174 of SEQ ID NO:2. preferably at least 
50% identity, more preferably at least 60% identity, more preferably at least 70% identity, 
more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, 
more preferably at least 95% identity, more preferably at least 97% identity, more prefera- 
bly at least 98% identity, even more preferably at least 99% identity with positions 34-174 
of SEQ ID NO:2; 

(d) a polypeptide encoded by a DNA sequence that hybridizes to the DNA sequence of 
positions 109-531 of SEQ ID NO:1 preferably under low stringency conditions, more pref- 
erably at least under medium stringency conditions, more preferably at least under me- 
dium/high stringency conditions, more preferably at least under high stringency conditions, 
even more preferably at least under very high stringency conditions; 

(e) a p olypeptide e ncoded b y a n i solated p olynucleotide m olecule which p olynucleotide 
molecule hybridizes to a denatured double-stranded DNA probe preferably under low 
stringency conditions, more preferably at least under medium stringency conditions, more 
preferably at least under medium/high stringency conditions, more preferably at least un- 
der high stringency conditions, even more preferably at least under very high stringency 
conditions, wherein the probe is selected from the group consisting of DNA probes com- 
prising the sequence shown in positions 109-531 of SEQ ID NO:1. and DNA probes com- 
prising a subsequence of positions 109-531 of SEQ ID NO:1, the subsequence having a 
length of at least about 100 base pairs, preferably at least 200 base pairs, more preferably 
at least 300 base pairs, more preferably at least 400 base pairs, more preferably at least 
440 base pairs, even more preferably a length of at least 450 base pairs, 

(f) a CBM polypeptide encoded by a DNA sequence obtainable from Pseudoplectania nigrelta 
CBS 444.97. 

In further aspects, the invention provides an expression vector comprising a DNA segment 
which is e.g. a polynucleotide molecule of the invention; a cell comprising the DNA segment 
or the expression vector; and a method of producing a CBM polypeptide, which method com- 
prises culturing the cell under conditions permitting the production of the CBM, and recover- 
ing the CBM from the culture. Further, impurities, such as homologous impurities can be re- 
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moved from the recovered CBM by use of purification methods generally known in the art 
In yet another aspect the invention provides an isolated CBM polypeptide characterized in (i) 
being free from homologous impurities and (ii) being produced by the method described 
above. 

The novel CBM of the present invention is useful for washing, treatment of textile, purification 
of polypeptides, immobilisation of active enzymes, modification of cellulosic material, baking, 
manufacturing of biofuel, modification of plant cell walls. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to non-catalytic carbohydrate-binding modules (CBM) obtain- 
able from the fungus Pseudoplectania nigrella and belonging to a new family of fungal CBM's. 
The CBM of the invention was found in association with a protein belonging to family 61 of the 
glycosyl hydrolases. The CBM of the invention is encoded by the DNA sequence of posi- 
tions 109-531 of SEQ ID NO:1 and has the amino acid sequence of positions 34-174 of 
SEQ ID NO:2. Said CBM preferably exhibits binding affinity for cellulose. The present inven- 
tion relates to a method of producing such CBM's; and to methods for using such CBM's in 
washing applications, for treatment of textile, purification of polypeptides, immobilisation of 
active enzymes, modification of cellulosic material, baking, manufacturing of biofuel, modifica- 
tion of plant cell walls. 

The inventors have succeeded in cloning and expressing a CBM bound to a family GH61 
enzyme. In addition, the inventors have expressed the domain only, without the GH61 en- 
zyme and demonstrated that the CBM alone can bind cellulose. Accordingly, the invention 
relates to a CBM of a new family of CBM's which CBM is 

(a) a polypeptide encoded by the DNA sequence of positions 109-531 of SEQ ID NO:1, or 
a DNA sequence homologous to SEQ ID NO:1, which DNA sequence has at least 40% 
identity with positions 109-531 of SEQ ID NO:1, preferably at least 50% identity, more 
preferably at least 60% identity, more preferably at least 70% identity, more preferably at 
least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at 
least 9 5% i dentity, m ore p referably a 1 1 east 9 7% i dentity, m ore p referably a t least 9 8% 
identity, even more preferably at least 99% identity with positions 109-531 of SEQ ID 
NO:1; 

(b) a polypeptide produced by culturing a cell comprising the DNA sequence of positions 
109-531 of SEQ ID NO:1 under conditions wherein the DNA sequence is expressed; 
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(c) a polypeptide having the amino acid sequence of positions 34-174 of SEQ ID NO:2 , or 
a polypeptide homologous to SEQ ID NO:2, which polypeptide has an amino acid se- 
quence of at least 40% identity with positions 34-174 of SEQ ID NO:2, preferably at least 
50% identity, more preferably at least 60% identity, more preferably at least 70% identity, 

5 more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, 
more preferably at least 95% identity, more preferably at least 97% identity, more prefera- 
bly at least 98% identity, even more preferably at least 99% identity with positions 34-174 
of SEQ ID NO:2; 

(d) a polypeptide encoded by a DNA sequence that hybridizes to the DNA sequence of 
10 positions 109-531 of SEQ ID NO:1 preferably under low stringency conditions, more pref- 
erably at least under medium stringency conditions, more preferably at least under me- 
dium/high stringency conditions, more preferably at least under high stringency conditions, 
even more preferably at least under very high stringency conditions; 

(e) a polypeptide encoded by an isolated polynucleotide molecule which polynucleotide 
15 molecule hybridizes to a denatured double-stranded DNA probe preferably under low 

stringency conditions, more preferably at least under medium stringency conditions, more 
preferably at least under medium/high stringency conditions, more preferably at least un- 
der high stringency conditions, even more preferably at least under very high stringency 
conditions, wherein the probe is selected from the group consisting of DNA probes com- 

20 prising the sequence shown in positions 109-531 of SEQ ID NO:1, and DNA probes com- 
prising a subsequence of positions 109-531 of SEQ ID NO:1, the subsequence having a 
length of at least about 100 base pairs, preferably at least 200 base pairs, more preferably 
at least 300 base pairs, more preferably at least 400 base pairs, more preferably at least 
440 base pairs, even more preferably a length of at least 450 base pairs, 

25 (f) a CBM polypeptide encoded by a DNA sequence obtainable from Pseudoplectania nigr&lla 
CBS 444.97. 

Hybridization 

Suitable experimental conditions for determining hybridization at low to very high strin- 
3 0 gency b etween a n ucleotide p robe a nd a h omologous DNAorRNAs equence i nvolves 
presoaking of the filter containing the DNA fragments or RNA to hybridize in 5 x SSC (So- 
dium chloride/Sodium citrate as described in Sambrook et a). (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Lab., Cold Spring Harbor, NY) for 10 min, and pre- 
hybridization of the fitter in a solution of 5 x SSC, 5 x Denhardt's solution (Sambrook et al. 
35 1989 supra), 0.5 % SDS and 100 pg/ml of denatured sonicated salmon sperm DNA (Sam- 
brook et al. 1989 supra), followed by hybridization in the same solution containing a con- 
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centration of 10ng/ml of a random-primed (Feinberg and Vogelstein (1983) Anal. Biochem. 
132, 6-13), 32 P-dCTP-tabeled (specific activity > 1 x 10 9 cpm/pg) probe for 12 hours at ca. 
45°C. The filter is then washed twice for 30 minutes in 2 x SSC, 0.5 % SDS at at least 
55°C (low stringency), more preferably at least 60°C (medium stringency), still more pref- 
5 erably at least 65°C (medium/high stringency), even more preferably at least 70°C (high 
stringency), and even more preferably at least 75°C (very high stringency). 

Sequence alignment and identity 

Nucleotide sequences may be aligned with the AlignX application of the Vector NTI Pro- 
10 gram Suite 7.0 (Informax, a subsidiary of Invitrogen Inc.) using the default settings, which 
employ a modified ClustalW algorithm (Thompson et al. (1994) Nuc. Acid Res. 22:4673- 
4680), the swgapdnarnt score matrix, a gap opening penalty of 15 and a gap extension 
penalty of 6.66. 

Amino acid sequences may be aligned with the AlignX application of the Vector NTI Pro- 
15 gram Suite v8 (Informax, a subsidiary of Invitrogen Inc) using default settings, which em- 
ploy a modified ClustalW algorithm (Thompson, et al. (1994) supra), the blosum62mt2 
score matrix, a gap opening penalty of 10 and a gap extension penalty of 0.1 . 

In a Smith-Waterman search (Smith and Waterman (1981) J. Mol. Biol. 147:195-197), 

2 0 generally considered to be very sensitive, two proteins registered some similarity at the amino 

acid level. The first was FIG2 of Saccharomyces cerevisiae (Swiss-Prot No. p25653). The 
Smith-Waterman score was 162, and showed 28.7% identity over a 143 base pair overlap. 
Explanation of homology: Both the CBM of the invention and FIG2 are highly serine-threonine 
rich. The region of similarity between the two proteins is thought to be highly glycosylated in 
25 FIG2. It could be that the pattern similarities in this region have more to do with glycosylation 
recognition than actual functional similarities. The second protein showing s ome s imilarity 
was a hypothetical protein from A rthrobacter n icotinovorans ( SPTREMBL:Q8GAM3). The 
protein has a Smith-Waterman score of 138 and was 30.5% identical in a 128 amino acid 
overlap. Homology is rather low overall. It is doubtful that these two proteins are either evolu- 

3 0 tionarily or functionally related. 

Size and 3D structure 

The CBM of the invention has six phenylalanine repeats at a spacing that would potentially 
put them on the same surface of a higher order structure such as a beta barrel or alpha 
35 helix. The three dimensional structures of representative members of CBM families 1-6, 9 
and 15 have been resolved by x-ray crystallography and NMR and according to Levy and 
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Shoseyov (Biotechnology Advances 20 (2002) 191-213), data from these structures indi- 
cate that CBD's from different families are structurally similar and that their cellulose bind- 
ing capacity can be attributed, at least in part, to several aromatic amino acids that com- 
pose their hydrophobic surface. The present inventors therefore wish to point out several 
5 phenylalanine residues and their significance to the ability of the CBM of the invention to 
bind cellulose. Below are subregions of the CBM with the residues marked: 

VPNFTATDVPTFTATDIPTFTATDVPIFTKKPQQPS (positions 64-99 of SEQ ID NO:2), 
and farther towards the C-terminus SVSFVAKPSAFIPKPSA (positions 110-126 of SEQ ID 

10 NO:2). 

The expressed CBM or CBM-containing polypeptide of the invention has a molecular 
weight (Mw) which is equal to or higher than about 15 kD in an unglycosylated form. The 
majority of the protein binding to Avicel appeared as a broad band of molecular weight 35- 

15 45 kDa, which is considerably higher than the 15 kDa of the protein part of the carbohy- 
drate binding module. The high and heterogeneous molecular weight is probably due to 
heterogeneity in O- and N-glycosylation of the N-terminal part of the protein. For heterolo- 
gously expressed CBMX in Aspergillus oryzae the size of CBMX can vary from 14 kDa to 
almost 70 kDa due to heterologous glycosylate of the protein. Moreover, N-terminal se- 

20 quencing of the 35-45 kDa band gave exclusively the sequence SFSSSGT (positions 47- 
53 of SEQ ID NO:9) indicating that heterogeneity in the N-terminal amino acid sequence is 
not present 

Preferably, the molecular weight of the CBM of the invention in an unglycosylated form is 
equal to or below about 70 kD, more preferably equal to or below 50 kD, more preferably 
2 5 equal to or below about 40 kD or 30 kD, even more preferably equal to or below about 25 
kD, even more preferably equal to or below about 20 kD, even more preferably equal to or 
below about 15 kD. 

Carbohydrate-binding modules 
30 Although a number of types of carbohydrate-binding modules have been described in the 
patent and scientific literature, the majority thereof, many of which derive from cellulolytic 
enzymes, are commonly referred to as cellulose-binding domains (CBD); a typical CBM 
will thus be one which occurs in a cellulase and which binds preferentially to cellulose 
and/or to poly- or oligosaccharide fragments thereof. 

35 

Cellulose-binding (and other carbohydrate-binding) modules are polypeptide amino acid 
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sequences which occur as integral parts of large polypeptides or proteins consisting of two 
or more polypeptide amino acid sequence regions, especially in hydrolytic enzymes 
(hydrolases) which typically comprise a catalytic domain containing the active site for 
substrate hydrolysis and a carbohydrate-binding domain for binding to the- carbohydrate 
5 substrate in question. Such enzymes can comprise more than one catalytic domain and 
one, two or three carbohydrate-binding domains, and they may further comprise one or 
more polypeptide amino add sequence regions linking the carbohydrate-binding domain(s) 
with the catalytic domain(s), a region of the latter type usually being denoted a "linker. 

10 In the protein complex, typically a hydrolytic enzyme, a CBM is located either at the N or C 
terminal or is internal. A monomeric CBM typically consists of more than about 30 and less 
than about 250 amino acid residues. For example, a CBM classified in Family I consists of 
33-37 amino acid residues; a CBM classified in Family 11a consists of 95-108 amino acid 
residues; and a CBM classified in Family VI consists of 85-92 amino acid residues. 

15 Accordingly, the molecular weight of a monomeric CBM will typically be in the range of 
from about 4kD to about 40kD, and usually below about 35kD. 

CBM's may be useful as a single domain polypeptide or as a dimer, a trimer, or a polymer, 
or as a part of a protein hybrid. 

20 Examples of hydrolytic enzymes comprising a carbohydrate-binding module are cellulases, 
xylanases, mannanases, arabinofuranosidases, acetylesterases, amylases, glucoamy- 
lases, mutanases and chitinases. CBM's have been shown to bind to carbohydrates such 
as cellulose, xylan, starch, chitin, mannan, beta-glucans, mutan and cyclodextrins. CBM's 
have been found in plants and algae, e.g. in the red alga Porphyra purpurea in the form of 

25 a non-hydrolytic polysaccharide-binding protein (see Tomme et at. (1996) Cellulose- 
Binding Domains, Classification and Properties in Enzymatic Degradation of Insoluble 
Carbohydrates, Saddler & Penner (Eds.), ACS Symposium Series, No. 618). 

Washing 

30 The present invention thus relates, inter a//a, to a process for removal or bleaching of soiling 
or stains present on cellulosic fabric or textile, wherein the fabric or textile is contacted in 
aqueous medium with a modified enzyme (enzyme hybrid) which comprises a catalytically 
(enzymatically) active amino acid sequence of a non-cell ulolytic enzyme linked to an amino 
acid sequence comprising a carbohydrate-binding module, such as a CBD. 

35 



Stains 



10499.000-DK 



10 

Soiling or stains which may be removed according to the present invention include those al- 
ready mentioned above, i.e. soiling or stains originating from, for example, starch, proteins, 
fats, red wine, fruit (such as blackcurrant, cherry, strawberry or tomato, in particular tomato in 
ketchup or spaghetti sauce), vegetables (such as carrot or beetroot), tea, coffee, spices (such 
5 as curry or paprika), body fluids, grass, or ink (e.g. from ball-point pens or fountain pens). 
Other types of soiling or stains which are appropriate targets for removal or bleaching in ac- 
cordance with the invention include sebum, soil (i.e. earth), clay, oil and paint A process for 
removal or bleaching of soiling or stains present on celiulosic fabric is described in WO 
97/28243. The process comprises contacting a fabric with an aqueous medium comprising a 
10 modified enzyme, which enzyme is a catalyticaliy active amino acid sequence of a non- 
celtulolytic enzyme which is linked to an amino acid sequence comprising a cellulose- 
binding domain. 

It is an object of the present invention to use the CBM of SEQ ID NO:2 in a process for 
removal or bleaching of soiling or stains present on celiulosic fabric as described in WO 
15 97/28243. 

Celiulosic fabric 

The term "celiulosic fabric" is intended to indicate any type of fabric, in particular woven fabric, 
prepared from a cellulose-containing material, such as cotton, or from a cellulose-derived ma- 
2 0 teriaf (prepared, e.g., from wood pulp or from cotton). 

In the present context, the term "fabric" is intended to include garments and other types of 
processed fabrics, and is used interchangeably with the term "textile' 7 . 

2 5 Examples of celiulosic fabric manufactured from naturally occurring celiulosic fibre are cotton, 

ramie, jute and flax (linen) fabrics. Examples of celiulosic fabrics made from man-made celiu- 
losic fibre are viscose (rayon) and lyocell (e.g. Tencel™) fabric; also of relevance in the con- 
text of the invention are all blends of celiulosic fibres (such as viscose, lyocell, cotton, ramie, 
jute or flax) with other fibres, e.g. with animal hair fibres such as wool, alpaca or camel hair, or 

3 0 with polymer fibres such as polyester, polyacrylic, polyamide or polyacetate fibres. 

Specific examples of blended celiulosic fabric are viscose/cotton blends, lyocell/cotton blends 
(e.g. Tencel™/cotton blends), viscose/wool blends, lyocell/wool blends, cotton/wool blends, 
cotton/polyester blends, viscose/cotton/polyester blends, wool/cotton/polyester blends, and 
3 5 flax/cotton blends. 
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Enzyme hybrids 

Enzyme classification numbers (EC numbers) referred to in the present patent application are 
in accordance with the Recommendations (1992) of the Nomenclature Committee of the 
International Union of Biochemistry and Molecular Biology, Academic Press Inc., 1992. 

5 

A modified enzyme (enzyme hybrid) for use in accordance with the invention comprises an 
enzymatically active amino acid sequence of a non-cellulolytic enzyme (i.e. a catalytically ac- 
tive amino acid sequence of an enzyme other than a cellulase) useful in relation to the clean- 
ing of fabric or textile, typically the removal or bleaching of soiling or stains from fabrics or tex- 

10 tiles in washing processes. Preferred are enzymes selected from the group consisting of 
amylases (e.g. a-amylases, EC 3.2.1.1), proteases (i.e. peptidases, EC 3.4), lipases (e.g. tri- 
acylglycerol lipases, EC 3.1.1.3) and oxidoreductases (e.g. peroxidases, EC 1.11.1, such as 
those classified under EC 1.11.1.7; or phenol-oxidizing oxidases, such as laccases, EC 
1.10.3.2, or other enzymes classified under EC 1.10.3), fused (linked) to an amino acid se- 

15 quence comprising a cellulose-binding module. The catalytically active amino acid sequence 
in question may comprise or consist of the whole, or substantially the whole of the full amino 
acid sequence of the mature enzyme in question, or it may consist of a portion of the full se- 
quence which retains substantially the same enzymatic properties as the full sequence. 

20 Modified enzymes of the type in question, as well as detailed descriptions of the preparation 
and purification thereof, are known in the art (see, e.g., WO 90/00609, WO 94/24158 and WO 
95/16782). They may be prepared by transforming into a host cell a DNA construct 
comprising at least a fragment of DNA encoding the CBM ligated, with or without a linker, to a 
DNA sequence encoding the enzyme of interest, and growing the transformed host cell to 

25 express the fused gene. One relevant, but non-limiting, type of recombinant product (enzyme 
hybrid) obtainable in this manner, often referred to in the art as a "fusion protein", may be 
described by one of the following general formulae: 

A-CBM-MR-X-B 

30 

A-X-MR-CBM-B 

In the latter formulae, CBM is an amino acid sequence comprising at least the carbohydrate- 
binding module (CBM) perse. 

35 



MR (the middle region; a linker) may be a bond, or a linking group comprising from 1 to about 
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100 amino acid residues, in particular of from 2 to 40 amino acid residues, e.g. from 2 to 15 
amino acid residues. MR may, in principle, alternatively be a non-amino-acid linker. 

X is an amino acid sequence comprising the above-mentioned, enzymatically active 
5 sequence of amino acid residues of a polypeptide encoded by a DNA sequence encoding the 
non-cellulolytic enzyme of interest. 

The moieties A and B are independently optional. When present, a moiety A or B constitutes 
a terminal extension of a CBM or X moiety, and normally comprises one or more amino acid 
10 residues. 

It will thus, inter alia, be apparent from the above that a CBM in an enzyme hybrid of the type 
in question may be positioned Oterminally, N-terminally or internally in the enzyme hybrid. 
Correspondingly, an X moiety in an enzyme hybrid of the type in question may be positioned 
1 5 N-terminally, Oterminally or internally in the enzyme hybrid. 

Enzyme hybrids of interest in the context of the invention include enzyme hybrids which com- 
prise more than one CBM, e.g. such that two or more CBM's are linked directly to each other, 
or are separated from one another by means of spacer or linker sequences, consisting typi- 
2 0 cally of a sequence of amino acid residues of appropriate length. Two CBM's in an enzyme 
hybrid of the type in question may, for example, also be separated from one another by 
means of an -MR-X- moiety as defined above. 

A very important issue in the construction of enzyme hybrids of the type in question is the 
25 stability towards proteolytic degradation. Two- and multi-domain proteins are particularly sus- 
ceptible towards proteolytic cleavage of linker regions connecting the domains. Proteases 
causing such cleavage may, for example, be subtilisins, which are known to often exhibit 
broad substrate specificities (see, e.g.: Gran et aL (1992) Biochemistry 31:6011-6018; Te- 
plyakov et al. (1992) Protein Engineering 5:41 3-420). 

30 

Glycosylation of linker residues in eukaryotes is one of Nature's ways of preventing prote- 
olytic degradation. Another is to employ amino acids which are less favoured by the surround- 
ing proteases. The length of the linker also plays a role in relation to accessibility by prote- 
ases. Which "solution" is optimal depends on the environment in which the enzyme hybrid is 
35 to function. When constructing new enzyme hybrid molecules, linker stability thus becomes 
an issue of great importance. 
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Cellulases (cellulase genes) useful for preparation of CBM's 

Techniques suitable for isolating a cellulase gene are well known in the art In the present 
context, the terms "cellulase" and "cellulolytic enzyme" refer to an enzyme which catalyses 
5 the degradation of cellulose to glucose, cellobiose, triose and/or other cello-oligosaccharides. 

Preferred cellulases (i.e. cellulases comprising preferred CBM's) in the present context are 
microbial cellulases, particularly bacterial or fungal cellulases. Endoglucanases, notably 
endo-1,4-p-<3lucanases (EC 3.2.1.4), particularly mono-component (recombinant) endo-1,4-(3- 

l o glucanases, are a preferred class of cellulases. 

Useful examples of bacterial cellulases are cellulases derived from or producible by bacteria 
from the group consisting of Pseudomonas, Bacillus, Celiutomonas, Clostridium, Microspore, 
Thermotoga, Caldocellum and Actinomycetes such as Streptomyces, Tenvomonospora and 
Acidothemus, in particular from the group consisting of Pseudomonas cellulolyticus, Bacillus 

15 lautus, Celiutomonas fimi, Clostridium thermoceltum, Microspora bispora, Termomonospora 
fusca, Termomonospora cellulolyticum and Acidothemus cellulolyticus. 

The c ellulase m ay b e a n a cid, a n eutral o r a n alkaline c ellulase, i .e. e xhibiting maximum 
cellulolytic activity in the acid, neutral or alkaline range, respectively, 

20 

A useful cellulase is an acid cellulase, preferably a fungal acid cellulase, which is derived from 
or producible by fungi from the group of genera consisting of Trichoderma, Myrothecium, 
Aspergillus, Phanaerochaete, Neurospora, Neocallimastix and Botrytis. 

25 A preferred useful acid cellulase is one derived from or producible by fungi from the group of 
species consisting of Trichoderma viride, Trichoderma reesei, Trichoderma longibrachiatum, 
Myrothecium verrucaria, Aspergillus niger, Aspergillus oryzae, Phanaerochaete 
chrysosporium, Neurospora crassa, Neocallimastix partriciarum, Pseudoplectania nigrella 
and Botrytis cinerea. 

30 

Another useful cellulase is a neutral or alkaline cellulase, preferably a fungal neutral or alka- 
line cellulase, which is derived from or producible by fungi from the group of genera con- 
sisting of Aspergillus, Penicillium, Myceliophthora, Humicola, Irpex, Fusarium, Stachybotrys, 
Scopulariopsis, Chaetomium, Mycogone, Verticillium, Myrothecium, Papulospora, Gliocla- 
3 5 dium, Cephalosporium, Pseudoplectania nigrella and Acremonium. 
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A preferred alkaline cellulase is one derived from or producible by fungi from the group of 
species consisting of Humicola insolens, Fusarium oxyspomm, Mycefiopthora thermophila, 
. Penicillium janthinellum and Cephalosporium sp., preferably from the group of species con- 
sisting of Humicola insolens DSM 1800, Fusarium oxysporum DSM 2672, Myceliopthora trier- 
5 mophila CBS 1 17.65, and Cephalosporium sp. RYM-202. 

Other examples of useful cellulases are variants of parent cellutases of fungal or bacterial 
origin, e.g. variants of a parent cellulase derivable from a strain of a species within one of the 
fungal genera Humicola, Trichoderma or Fusarium. 

10 

Amvlolvtic enzymes 

Amylases (e.g. a- or p-amylases) which are appropriate as the basis for enzyme hybrids of 
the types employed in the context of the present invention include those of bacterial or fungal 
origin. Chemically or genetically modified mutants of such amylases are included in this 
15 connection. Relevant a-amylases include, for example, a-amylases obtainable from Bacillus 
species, in particular a special strain of B. licheniformis f described in more detail in GB 
1296839. Relevant commercially available amylases include Duramyl®, Termamyl®, Fun- 
gamyl® and BAN® (all available from Novozymes A/S t Bagsvaerd, Denmark), and Rapidase™ 
and Maxamyl™ P (available from DSM, Holland). 

20 

Other useful anxiolytic enzymes are CGTases (cyclodextrin glucanotransferases, EC 
2.4.1.19), e.g. those obtainable from species of Bacillus, Thermoanaerobactor or Thermoan- 
aerobacterium. 

25 Proteolytic enzymes 

Proteases (peptidases) which are appropriate as the basis for enzyme hybrids of the types 
employed in the context of the present invention include those of animal, vegetable or micro- 
bial origin. Proteases of microbial origin are preferred. Chemically or genetically modified mu- 
„ tants of such proteases are included in this connection. The protease may be a serine prote- 

30 ase, preferably an alkaline microbial protease or a trypsin-like protease. Examples of alkaline 
proteases are subtilisins, especially those derived from Bacillus, e.g., subtllisin Novo, subtil- 
isin Cartsberg, subtilisin 309, subtilisin 147 and subtilisin 168 (described in WO 89/06279). 
Examples of trypsin-like proteases are trypsin (e.g. of porcine or bovine origin) and the Fusa- 
rium protease described in WO 89/06270. 

35 

Relevant commercially available protease enzymes include Alcalase* Savinase® and Es- 
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perase® (all available from Novozymes A/S, Bagsvaerd, Denmark), Maxatase™, Maxacal™, 
Maxapem™ and Properase™ (available from DSM, Holland), Purafect™ and Purafect™ OXP 
(available from Genencor International, USA), and Opticlean™ and Optimase™ (available 
from by Solvay Enzymes). 

5 

Lipolytic enzymes 

Lipolytic e nzymes (lipases) w hich a re a ppropriate a s t he b asis for e nzyme h ybrids of the 
types employed in the context of the present invention include those of bacterial or fungal ori- 
gin. Chemically or genetically modified mutants of such lipases are included in this connec- 
10 tion. 

Examples of useful lipases include a Humicola lanuginosa lipase, e.g. as described in EP 258 
068 and EP 305 216; a Rhizomucor miehei lipase, e.g. as described in EP 238 023; a Can- 
dida lipase, such as a C. antarctica lipase, e.g. the C. antarctica lipase A or B described in EP 

15 214 761; a Pseudomonas lipase, such as one of those described in EP 721 981 (e.g. a lipase 
obtainable from a Pseudomonas sp. SD705 strain having deposit accession number FERM 
BP-4772), in PCT/JP96/00426, in PCT/JP96/00454 (e.g. a P. solanacearum lipase), in EP 
571 982 or in WO 95/14783 (e.g. a P. mendocina lipase), a P. alcaligenes or P. pseudoalcalh 
genes lipase, e.g. as described in EP 218 272, a P. cepacia lipase, e.g. as described in EP 

20 331 376, a P stutzeri lipase, e.g. as disclosed in GB 1,372,034, or a P fluorescens lipase; a 
Bac///us lipase, e.g. a B. subtilis lipase (Dartois et al. (1993) Biochemica et Biophysica Acta 
1 131:253-260), a 0. stearothermophilus lipase (JP 64/744992) and a B. pumilus lipase (WO 
91/16422). 

25 Furthermore, a number of cloned lipases may be useful, including the Penicillium camember- 
tii lipase described by Yamaguchi et al. (1991) in Gene 103:61-67, the Geotricum candidum 
lipase (Schimada et al. (1989) J. Biochem. 106:383-388), and various Rhizopus lipases such 
as an R. delemar lipase (Hass et al. (1991) Gene 109:117-113), an R. niveus lipase 
(Kugimiya et al. (1992) Biosd. Biotech. Biochera 56:716-719) and a R. oryzae lipase. 

30 

Other potentially useful types of lipolytic enzymes include cutinases, e.g. a cutinase derived 
from Pseudomonas mendocina as described in WO 88/09367, or a cutinase derived from 
Fusanum solani f. pisi (described, e.g., in WO 90/09446). 

35 Suitable commercially available lipases include Upolase® and Upolase Ultra® (available from 
Novozymes A/S), M1 Lipase™, Lumafast™ and Lipomax™ (available from DSM, Holland) 
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and Lipase P "Amano" (available from Amano Pharmaceutical Co. Ltd.). 
Oxidoreductases 

Oxldoreductases which are appropriate as the basis for enzyme hybrids of the types em- 
5 ployed in the context of the present invention include peroxidases (EC 1.11.1) and oxidases, 
such as laccases (EC 1.10.3.2) and certain related enzymes. 

Peroxidases 

Peroxidases (EC 1.11.1) are enzymes acting on a peroxide (e.g. hydrogen peroxide) as ac- 
10 ceptor. Very suitable peroxidases are those classified under EC 1.11.1.7, or any fragment 
derived therefrom, exhibiting peroxidase activity. Synthetic or semisynthetic derivatives 
thereof (e.g. with porphyrin ring systems, or microperoxidases, cf., for example, US 
4,077,768, EP 537 381 , WO 91/05858 and WO 92/16634) may also be of value in the context 
of the invention. 

15 

Very suitable peroxidases are peroxidases obtainable from plants (e.g. horseradish 
peroxidase or soy bean peroxidase) or from microorganisms, such as fungi or bacteria. In this 
respect, some preferred fungi include strains belonging to the subdivision Deuteromycotina, 
class Hyphomycetes, e.g. Fusarium, Humicola, Tricoderma, Myrothecium, Vertictllum, 
2 0 Arthromyces, Caldariomyces. Ulocladium, Embellisia, Cladosporium or Dreschlera, in par- 
ticular Fusarium oxysporum (DSM 2672), Humicola insolens, Trichoderma resit, Myrothecium 
verrucana (IFO 6113), Verticfflum atboatrum, Verticillum dahlie f Arthromyces ramosus (FERM 
P-7754), Caldariomyces fumago, Ulocladium chartarum, Embellisia alii or Dreschlera 
halodes. 

25 

Other preferred fungi include strains belonging to the subdivision Basidiomycotina. class 
Basidiomycetes, e.g. Coprinus, Phanerochaete, Coriolus or Trametes, in particular Coprinus 
cinereusf. microsporus (IFO 8371), Coprinus macrorhizus, Phanerochaete chrysosporium 
(e.g. NA-12) or Trametes versicolor (e.g. PR4 28-A). 

30 

Further preferred fungi include strains belonging to the subdivision Zygomycotina, class 
Mycoraceae, e.g. Rhizopus or Mucor, in particular Mucor hiemalis. 

Some preferred bacteria include strains of the order Actinomycetales, e.g. Streptomyces 
35 spheroides (ATTC 23965), Streptomyces thermoviotaceus (IFO 12382) or Streptoverticillum 
verticillium ssp. verticitlium. 
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Other preferred bacteria include Bacillus pumifus (ATCC 12905), Bacillus steargthermophilus, 
Rhodobacter sphaeroides, Rhodomonas palustri, Streptococcus lactis, Pseudomonas 
purrocinia (ATCC 15958) or Pseudomonas fluorescens (NRRL B-1 1). 

5 

Further preferred bacteria include strains belonging to Myxococcus, e.g. M virescens. 

Other potential sources of useful particular peroxidases are listed in Saunders et al. (1964) 
Peroxidase, 41-43 London. 

10 

The peroxidase may furthermore be one which is producible by a method comprising 
cultivating a host cell - transformed with a recombinant DNA vector which carries a DNA 
sequence encoding said peroxidase as well as DNA sequences encoding functions 
permitting the expression of the DNA sequence encoding the peroxidase - in a culture 
15 medium under conditions permitting the expression of the peroxidase, and recovering the 
peroxidase from the culture. 

A suitable recombinant^ produced peroxidase is a peroxidase derived from a Coprinus sp., in 
particular C. macrorhizus or C. cinereus according to WO 92/16634, or a variant thereof, e.g. 
2 0 a variant as described in WO 94/1 2621 . 

Oxidases and related enzymes 

Preferred oxidases in the context of the present invention are oxidases classified under EC 
1.10.3, which are oxidases employing molecular oxygen as acceptor (i.e. enzymes catalyzing 

2 5 oxidation reactions in which molecular oxygen functions as oxidizing agent). 

As indicated above, laccases (EC 1.10.3.2) are very suitable oxidases in the context of the 
invention. Examples of other useful oxidases in the context of the invention include the 
catechol oxidases (EC 1.10.3.1) and bilirubin oxidases (EC 1.3.3.5). Further useful, related 

3 0 enzymes include monophenol monooxygenases (EC 1 . 1 4. 1 8. 1 ). 

Laccases are obtainable from a variety of plant and microbial sources, notably from bacteria 
and fungi (including filamentous fungi and yeasts), and suitable examples of laccases are to 
found among those obtainable from fungi, including laccases obtainable from strains of 
35 Aspergillus, Neurospora (e.g. N. cras$a) t Podospora, Botrytis, Collybia, Fomes, Lentinus, 
Pleurotus* Trametes (e.g. T. vtilosa or T. versicolor [some species/strains of Trametes being 
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known by various names and/or having previously been classified within other genera; e.g. 
Trametes villosa = T. pinsitus = Polyporus pinsitis (also known as P. pinsitus or P. villosus) = 
Coriolus pinsitus], Polyporus, Rhizoctonia (e.g. R so/an/), Coprinus (e.g. C. piicatilis or C. 
cinereus), Psatyrella, Myceliophthora (e.g. A/f. thermophila), Schytalidium, Phlebia (e.g. P. 
5 rad/fa; see WO 92/01046), Coriolus (e.g. C.hirsutus\ see JP 2-238885), Pyricularia or 
R/g/otopo/iis. 

Preferred laccases in the context of the invention include laccase obtainable from 
species/strains of Trametes (e.g. 7. villosa), Myceliophthora (e.g. M thermophila), 
1 0 Schytalidium or Polyporus. 

Other enzymes 

Further classes of enzymes which are appropriate as the basis for enzyme hybrids of the 
types employed in the context of the present invention include pectinases such as pectate 

15 lyase (EC 4.2.2.2), pectin lyase (EC 4.2.2.10), rhamnogalacturonan lyase (EC not defined), 
endo-1,4-galactanase (EC 3.2.1.89), xyloglucanase (EC not defined), xylanase (EC 3.2.1.8), 
arabinanase (EC 3.2.1.99), alpha-L-arabinofuranosidase (EC 3.2.1.55), Mannan endo-1 ,4- 
mannosidase (EC 3.2.1.78), beta-mannosidase ( EC 3.2.1.25), beta-1 ,3-1 ,4-glucanase ( EC 
3.2.1.73), rhamnogalacturonan hydrolase, exo-polygalacturonase (EC 3.2.1.67), rhamnoga- 

20 lacturonase (EC not defined), Cellulase (EC 3.2.1.4), Glucan 1 ,3-beta-glucosidase (EC 
3.2.1.58), Licheninase (EC 3.2.1.73), Glucan endo-1 ,6-beta-glucosidase (EC 3.2.1.75), Man- 
nan endo-1 ,4-beta-mannosidase (EC 3.2.1.78), Endo-1 ,4-beta-xyIanase (EC 3.2.1.8), Cellu- 
lose 1,4-cellobiosidase (EC 3.2.1.91), cellobiohydrolase (EC 3.2.1.91). (polygalacturonases 
(EC 3.2.1.15). Acetyl and methyl esterase enzymes such as: rhamnogalacturonan methyl es- 

2 5 terase, rhamnogalacturonan acetyl esterase, pectin methylesterase (EC 3.1.1.11), pectin 

acetylesterase (EC not defined), xylan methyl esterase, acetyl xylan esterase (EC 3.1.1.72), 
feruloyl esterase (EC 3.1.1 .73), cinnamoyl esterase (EC 3.1.1.73). 

Detergents 

3 0 The CBM of the invention may be added to a detergent for washing textile, such as a laundry 

detergent or a detergent for washing hard surfaces, such as a dish washing detergent A de- 
tergent composition comprising the CBM of the invention can further comprise one or more 
enzymes selected from the group consisting of proteases, cellulases (endo-glucanases), 
beta-glucanases, hemicellulases, lipases, peroxidases, laccases, alpha-amylases, glucoa- 
3 5 myiases, cutinases, pectinases, reductases, oxidases, phenoloxidases, ligninases, pullu- 
lanases, pectate lyases, xyloglucanases, xylanases, pectin acetyl esterases, polygalacturo- 
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nases, rhamnogalacturonases, pectin lyases, other mannanases, pectin methylesterases, 
cellobiohydrolases, transglutaminases; or mixtures thereof. 

Further, a detergent composition in accordance with the invention may contain ordinary de- 
tergent components such as for example a surfactant, a builder, a bleach, a suds suppressor 
5 as described in WO 99/27082. 

Treatment of textile 

During the weaving of textiles, the threads are exposed to considerable mechanical strain. In 
order to prevent breaking, they are usually reinforced by coating (sizing) with a gelatinous 
l o substance called "size". 

The most common sizing agent is starch in native or modified form. However, other polymeric 
substances, for example poly-vinylalcohol (PVA), polyvinylpyrrolidone (PVP), polyacrylic acid 
(PAA) or derivatives of cellulose [e.g. carboxy-methylcellulose (CMC), hydroxyethylcellulose, 
15 hydroxypropyl-ceilulose or methylcellulose] may also be abundant in the size. Small amounts 
of, e.g., fats or oils may also be added to the size as a lubricant. 

As a consequence of the presence of the size, the threads of the fabric are not able to absorb 
water, finishing agents or other compositions (e.g. bleaching, dyeing or crease-proofing 
20 compositions) to a sufficient degree. Uniform and durable finishing of the fabric can thus be 
achieved only after removal of the size from the fabric; a process of removing size for this 
purpose is known as a "desizing" process. 

In cases where the size comprises a starch, the desizing treatment may be carried out using 
25 a starch-degrading enzyme {e.g. an amylase). In cases where the size comprises fat and/or 
oil, the desizing treatment may comprise the use of a lipolytic enzyme (a lipase). In cases 
where the size comprises a significant amount of carboxymethylcellulose (CMC) or other 
cellulose-derivatives, the desizing treatment may be carried out with a cellulolytic enzyme, 
either a lone o r i n c ombination w ith o ther s ubstances, o ptionally i n c ombination w ith o ther 
3 0 enzymes, such as amylases and/or lipases. 

It is an object of the present invention to achieve improved enzyme performance under desiz- 
ing conditions by modifying the enzyme so as to alter (increase) the affinity of the enzyme for 
cellulosic fabric, whereby the modified enzyme comes into closer contact with the sizing 
3 5 agent in question. 
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The present invention thus relates, inter alia, to a process for desizing cellulosic fabric or tex- 
tile, wherein the fabric or textile is treated (normally contacted in aqueous medium) with a 
modified enzyme (enzyme hybrid) which comprises a catalytically (enzymatically) active 
amino acid sequence of an enzyme, in particular of a non-celluloiytic enzyme, linked to an 
5 amino acid sequence comprising a carbohydrate-binding module, such as the CBM of SEQ 
ID NO:2. This process is described in further detail in WO 97/28256. 

The term "desizing" is intended to be understood in a conventional manner, i.e. the removal 
of a sizing agent from the fabric. 

10 

Scouring 

The scouring process removes non-cellulosic material from the cotton fiber, especially the 
cuticle (mainly consisting of waxes) and primary cell wall (mainly consisting of pectin, pro- 
tein and xyloglucan) before bleaching and dying of the textile. A proper wax removal is 
15 necessary for obtaining a high wettability, being a measure for obtaining a good dyeing. 
Removal of the primary cell wall improves wax removal and ensures a more even dyeing. 
Further this improves the whiteness in the bleaching process. 

The CBM's of family CBM1 are known to wedge into crystalline cellulose like expansins 
and swollenins, and aid in the release of non cellulose components or contaminants from 
20 textile. The dye accessibility can be increased by treating the textile with CBM's. The 
CBM's of the invention have properties similar to the CBM's of family CBM1. Accordingly, 
the CBM's of the invention may be used to remove non-cellulosic material from the cotton 
fiber in the scouring process. 

2 5 Affinity tags 

CBM's that bind reversibly to carbohydrates are useful for separation and purification of 
target polypeptides. CBM's of family I bind reversibly to crystalline cellulose and are useful 
tags for affinity chromatography. 

It is an object of the present invention to achieve improved separation and purification of tar- 
30 " get polypeptides by use of CBM's as affinity tags as describes by Terpe (2003) Appl. Micro- 
biol. BiotechnoL 60:523-533 and US 5,670.623. 

Immobilisation of molecules 

Some CBM's bind irreversibly to cellulose and can be used for immobilization of molecules 
35 such as metallothioneins, phytochelatins or enzymes. Such an immobilization is useful in 
e.g. removal of heavy metal contaminations from the environment, wherein the heavy 



10499.000-DK 

21 

metal ions bind to polypeptides biosorbents such as m etallothioneins or p hytochelatins, 
and the CBM-biosorbent-heavy metal complex is irreversibly immobilized by the binding of 
the CBM to a carbohydrate material (Xu et al. (2002) Biomacromolecules 3:462-465. It is 
an object of the present invention to achieve immobilization of molecules such as metal- 
5 lothioneins, phytochelatins or enzymes by use of the CBM of SEQ ID NO:2 as fusion pro- 
teins. Further, a method for removal contaminants such as heavy metals from the envi- 
ronment, by immobilization with CBM's it is an object of the present invention. 

CBM conjugates 

10 A carbohydrate-binding domain conjugate, such as a CBD conjugate, comprises at least 
two CBD's attached to a polysaccharide. The polysaccharide may be capable of binding to 
cellulose, and is conveniently locust bean gum. CBD conjugates are able to increase the 
strength of cellulosic material such as fabric by cross-linking fibres a s d escribed in GB 
2376017 to Unilever, it is an object of the present invention to increase the strength and 

15 wear of the fabric by cross-linking fibres by use of the CBM of SEQ ID NO:2. 

The CBD conjugates may also be used as delivery vehicles to deposit materials on textile 
in any stage of the laundering process. This latter application can be achieved by coating 
the benefit agent, either directly by chemical means or indirectly via a compound associ- 
2 0 ated with the benefit agent e.g. a capsule as described in GB 2376017 to Unilever. Exam- 
ples of such benefit agents are softening agents, finishing agents, protecting agents, fra- 
grances such as perfumes and bleaching agents. 

Examples of softening agents are clays, cationic surfactants or silicon compounds. Exam- 
ples of finishing agents and protecting agents are polymeric lubricants, soil repelling 
25 agents, soil release agents, photo-protective agents such as sunscreens, anti-static 
agents, dye-fixing agents, anti-bacterial agents and anti-fungal agents. The fragrances or 
perfumes may be encapsulated, e.g. in latex or microcapsules or gelatin based coacer- 
vates. 

It is an object of the present invention to use of the CBM of SEQ ID NO:2 as a delivery ve- 
30 hide to deposit materials on textile in the laundry process. 

Baking 

It is known that adding CBM's to xylanases or amylases or other baking active enzymes, 
may result in better performance in baking trials than enzymes without CBM's. In W O 
35 98/16112 it is described how antistaling enzymes such as amylolytic enzymes was fused 
to a CBD and used to retard staling and aging of baked bread. It is an object of the present 
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invention to fuse the CBM of SEQ ID NO:2 with amylotytic enzymes and use it to retard 
staling and aging of baked bread as described in WO 98/161 12. 



Use of CBM's for production of bioethanol 

S Ethanol can be produced from agricultural waste or biomass (biofuel). The ethanol convertible 
components of many types of biomass (for example, corn stover, wood pulp and wheat straw) 
consists largely of crystalline cellulose. Crystalline cellulose is naturally resistant to enzymatic 
degradation because the cellulose fibrils are tightly packed together thus creating an accessi- 
bility problem for cellulose degrading enzymes. A number of methods for opening the struc- 
10 ture of crystalline cellulose in biomass are being investigated: acid pre-treatment with steam 
explosion is one well studied method (Bura et al. (2002) Appl Biochem Biotechnol. 98-100:59- 
72). Wet oxidation is another method described by Naito et al. (2001) Journal of Chemical 
Engineering of Japan, 34(12)1545-1548. 

There is clear evidence that cellulose binding domains alone can alter the characteristics of 
15 crystalline cellulose (Shoseyov et al. (2002) Proceedings of the 223th American Chemical 
Society National Meeting. Orlando, Florida, USA. It is an object of the present invention to 
use the CBM of the invention for disruption of the microcrystalline nature of the cellulose mi- 
crofibrils found in biomass for production of biofuel so as to increase accessibility of cellulose 
degrading enzymes to the biomass. 

20 

Modification of plant cell walls 

By introducing a gene encoding a CBM such as a CBD into plants or microorganisms it is 
possible to express CBD proteins within the cell wall of the microorganism or plant tissue. 
CBD proteins have been shown to bind to newly synthesized cellulose fibres in plant cell 

25 wails, and this physico-mechanical interference uncouples cellulose synthesis by the sub- 
units of the cellulose synthase enzyme complexes. This results in an increased rate of syn- 
thesis of the cellulose polymer, improved polymer qualities and enhanced biomass. The 
increased rate of cellulose synthesis in the cell wall leads to enhanced cellulose produc- 
tion, greater biomass at the plant level, improved fibre properties and may enhance resis- 

30 tance to biotic and abiotic stress. A CBD encoding gene can be inserted into hardwood 
forestry species and subsequent substantial volume increases with improvements in wood 
density and fibre properties can be demonstrated. These improvements will carry through 
to the finished paper, exhibiting enhanced tensile, tear and burst indices (US 6,184,440). 
It is an object of the present invention to insert the CBM encoding DNA sequence of SEQ ID 

3 5 NO: 1 into a plant in order to alter the cell walls of said plant, resulting in enhanced growth and 
biomass, increased cellulose production, improved fibre properties, improved digestibility by 
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livestock, and increased yield properties as described in US 6,184,440. 
CBM composition 

When used for the applications described above, the CBM of the invention may be part of a 
5 composition made for the specific application. Further components in such compositions 
comprise a carrier compound, and one or more enzymes selected from the group consisting 
of proteases, cellulases, beta-glucanases, hemicellulases, lipases, peroxidases, laccases, 
alpha-amylases, glucoamylases, cutinases, pectinases, reductases, oxidases, phenoloxi- 
dases, ligninases, pullulanases, pectate lyases, xyloglucanases, xylanases, pectin acetyl es- 
10 terases, polygalacturonases, rhamnogalacturonases, pectin lyases, other mannanases, pec- 
tin methylesterases, cellobiohydrolases, transglutaminases; or mixtures thereof. 

Expression of the CBM of the Invention 

Nucleic acid constructs comprising nucleotide sequences 
15 The present invention relates to nucleic acid constructs comprising a nucleotide sequence 
of the invention operably linked to one or more control sequences that direct the expres- 
sion of the coding sequence in a suitable host cell under conditions compatible with the 
control sequences. 

A nucleotide sequence encoding a CBM of the invention may be manipulated in a variety 

2 0 of ways to provide for expression of the CBM. Manipulation of the nucleotide sequence 

prior to its insertion into a vector may be desirable or necessary depending on the expres- 
sion vector. The techniques for modifying nucleotide sequences utilizing recombinant DNA 
methods are well known in the art. 

The control sequence may be an appropriate promoter sequence, a nucleotide sequence 
25 which is recognized by a host cell for expression of the nucleotide sequence. The promoter 
sequence contains transcriptional control sequences, which mediate the expression of the 
polypeptide. The promoter may be any nucleotide sequence which shows transcriptional 
activity in the host cell of choice including mutant, truncated, and hybrid promoters, and 
may be obtained from genes encoding extracellular or intracellular polypeptides either ho- 

3 0 mologous or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of the nucleic acid constructs 
of the present invention, especially in a bacterial host cell, are the promoters obtained from 
the E. coli lac operon, Streptomyces coelicoior agarase gene (dagA), Bacillus subtitis le- 
3 5 vansucrase gene (sacB), Bacillus licheniformis alpha-amylase gene {amyL), Bacillus 
stearqthermophilus maltogenic amylase gene (amy/W), Bacillus amyloliquefaciens alpha- 
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amylase gene {arnyQ), Bacillus licheniformis penicillinase gene (penP), Bacillus subtilis 
xylA and xylB genes, and prokaryotic beta-lactamase gene (Vilia-Kamaroff et al. (1978) 
Proceedings of the National Academy of Sciences USA 75:3727-3731), as well as the tac 
promoter (DeBoer et al. (1983) Proceedings of the National Academy of Sciences USA 
5 80:21-25). Further promoters are described in "Useful proteins from recombinant bacteria" 
in Scientific American (1980) 242:74-94; and in Sambrook et al. (1989) supra. 

Examples of suitable promoters for directing the transcription of the nucleic acid constructs 
of the present invention in a filamentous fungal host cell are promoters obtained from the 

10 genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, As- 
pergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Asper- 
gillus niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei I ipase, As- 
pergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Asper- 
gillus nidulans acetamidase, and Fusarium oxysporum trypsin-Jike protease (WO 

15 96/00787), as well as the NA2-tpi promoter (a hybrid of the promoters from the genes for 
Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose phosphate isom- 
erase), and mutant, truncated, and hybrid promoters thereof. 

In a yeast host, useful promoters are obtained from the genes for Saccharomyces cere- 
20 visiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), Saccharomy- 
ces cerevisiae alcohol dehydrogenase/gIyceraldehyde-3-phosphate dehydrogenase 
(ADH2/GAP), and Saccharomyces cerevisiae 3-phosphoglycerate kinase. Other useful 
promoters for yeast host cells are described by Romanos et al. (1992) Yeast 8:423-488. 

25 The control sequence may also be a suitable transcription terminator sequence, a se- 
quence recognized by a host cell to terminate transcription. The terminator sequence is 
operably linked to the 3* terminus of the nucleotide sequence encoding the CBM. Any ter- 
minator which is functional in the host cell of choice may be used in the present invention. 
Preferred terminators for filamentous fungal host cells are obtained from the genes for As- 

30 pergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans an- 
thranilate synthase, Aspergillus niger alpha-glucostdase, and Fusarium oxysporum trypsin- 
like protease. 

Preferred terminators for yeast host cells are obtained from the genes for Saccharomyces 
35 cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomy- 
ces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for 
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yeast host cells are described by Romanes et al. (1992) supra. 

The control sequence may also be a suitable leader sequence, a non-translated region of 
an mRNA which is important for translation by the host cell. The leader sequence is oper- 
5 ably linked to the 5' terminus of the nucleotide sequence encoding the polypeptide. Any 
leader sequence that is functional in the host cell of choice may be used in the present in- 
vention. Preferred leadens for filamentous fungal host cells are obtained from the genes for 
Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase, 

10 Suitable leaders for yeast host cells are obtained from the genes for Saccharomyces cere- 
visiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Sac- 
charomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydro- 
genase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP). 

15 The control sequence may also be a polyadenylation sequence, a sequence operably 
linked to the 3' terminus of the nucleotide sequence and which, when transcribed, is rec- 
ognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. 
Any polyadenylation sequence which is functional in the host cell of choice may be used in 
the present invention. 

20 Preferred polyadenylation sequences for filamentous fungal host cells are obtained from 
the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergil- 
lus nidulans anthranilate synthase, Fusarium oxysporum trypsin-like protease, and Asper- 
gillus niger alpha-glucosidase. 

Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman 
25 (1995) Molecular Cellular Biology 15:5983-5990, 

The control sequence may also be a signal peptide coding region that codes for an amino 
acid sequence linked to the amino terminus of a polypeptide and directs the encoded CBM 
into the cell's secretory pathway. The 5' end of the coding sequence of the nucleotide se- 

30 quence may inherently contain a signal peptide coding region naturally linked in translation 
reading frame with the segment of the coding region which encodes the secreted CBM. 
Alternatively, the 5' end of the coding sequence may contain a signal peptide coding re- 
gion which is foreign to the coding sequence. The foreign signal peptide coding region may 
be required where the coding sequence does not naturally contain a signal peptide coding 

35 region. Alternatively, the foreign signal peptide coding region may simply replace the natu- 
ral signal peptide coding region in order to enhance secretion of the CBM. However, any 
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signal peptide coding region which directs the expressed CBM into the secretory pathway 
of a host cell of choice may be used in the present invention. The native signal peptide 
coding region of the CBM of the present invention is nucleotides 10 to 69 of SEQ ID NO:1 
encoding amino acids 1 to 20 of SEQ ID NO:2. 

5 

Effective signal peptide coding regions for bacterial host cells are the signal peptide coding 
regions o btained from the genes for Bacillus N CIB 1 1837 maltogenic amylase, Bacillus 
stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis 
beta-Iactamase, Bacillus stearothermophilus neutral proteases (nprT, nprS t nprM), and 
10 Bacillus subtilis prsA. Further signal peptides are described by Simonen and Palva (1993) 
Microbiological Reviews 57:109-137. 

Effective signal peptide coding regions for filamentous fungal host cells are the signal pep- 
tide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, Asper- 
gillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor miehei aspartic 
15 proteinase, Humicola insolens cellulase, Candida antarctica lipase and Humicola lanugi- 
nosa lipase. 

Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces 
cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal pep- 
tide coding regions are described by Romanos et al. (1992) supra. 

20 

The control sequence may also be a propeptide coding region that codes for an amino 
acid sequence positioned at the amino terminus of a CBM. The resultant polypeptide may 
be denoted a pro-CBM or propolypeptide. A propolypeptide is generally inactive and can 
be converted to a mature active polypeptide by catalytic or autocatalytic cleavage of the 
25 propeptide from the propolypeptide. The propeptide coding region may be obtained from 
the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease 
(nprT), Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, 
and Myceliophthora thermophila laccase (WO 95/33836). 

Where both signal peptide and propeptide regions are present at the amino terminus of a 
3 0 polypeptide, the propeptide region is positioned next to the amino terminus of a polypep- 
tide and the signal peptide region is positioned next to the amino terminus of the propep- 
tide region. 

In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the TAKA 
35 alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and Aspergillus oryzae 
glucoamylase promoter may be used as regulatory sequences. Other examples of regula- 
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tory sequences are those which allow for gene amplification. In eukaryotic systems, these 
include the dihydrofolate reductase gene which is amplified in the presence of meth- 
otrexate, and the metallothionein genes which are amplified with heavy metals. In these 
cases, the nucleotide sequence encoding the polypeptide would be operably linked with 
5 the regulatory sequence. 

Recombinant expression vector comprising nucleic acid construct 

The present invention also relates to recombinant expression vectors comprising the nu- 
cleic acid construct of the i nvention. The various nucleotide and control sequences de- 

10 scribed above may be joined together to produce a recombinant expression vector, which 
may include one or more convenient restriction sites to allow for insertion or substitution of 
the nucleotide sequence encoding the polypeptide at such sites. Alternatively, the nucleo- 
tide sequence of the present invention may be expressed by inserting the nucleotide se- 
quence or a nucleic acid construct comprising the sequence into an appropriate vector for 

15 expression. In creating the expression vector, the coding sequence is located in the vector 
so that the coding sequence is operably linked with the appropriate control sequences for 
expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) which can 
be conveniently subjected to recombinant DNA procedures and can bring about the ex- 
20 pression of the nucleotide sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host cell into which the vector is to be introduced. The 
vectors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, /.e. a vector which exists as an ex- 
trachromosomal entity, the replication of which is independent of chromosomal replication, 
2 5 e.g. a plasmid, an extrachromosomat element, a minichromosome, or an artificial chromo- 
some. 

The vector may contain any means for assuring self-replication. Alternatively, the vector 
may be one which, when introduced into the host cell, is integrated into the genome and 
replicated together with the chromosome(s) into which it has been integrated. Furthermore, 
30 a single vector or plasmid or two or more vectors or plasmids which together contain the 
total DNA to be introduced into the genome of the host cell, or a transposon may be used. 

The vectors of the present invention preferably contain one or more selectable markers 
which permit easy selection of transformed cells. A selectable marker is a gene the prod- 
35 uct of which provides for biocide or viral resistance, resistance to heavy metals, prototro- 
phy to auxotrophs, and the like. 
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Examples of bacterial selectable markers are the dal genes from Bacillus subtilis or 8ac//- 
lus licheniformis, or markers which confer antibiotic resistance such as ampicillin, kanamy- 
cin, chloramphenicol or tetracycline resistance. Suitable markers for yeast host cells are 
5 ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. Selectable markers for use in a fila- 
mentous fungal host cell include, but are not limited to, amdS (acetamidase), argB (or- 
nithine carbamoyltransferase), bar (phosphinothricin acetyltransferase), hygB (hygromycin 
phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-5*-phosphate decarboxy- 
lase), sC (sulfate adenyltransferase), trpC (anthranilate synthase), as well as equivalents 
10 thereof. Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergil- 
lus nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus. 

The vectors of the present invention preferably contain an elements) that permits stable 
integration of the vector into the host cell's genome or autonomous replication of the vector 

15 in the cell independent of the genome. For integration into the host cell genome, the vector 
may rely on the nucleotide sequence encoding the polypeptide or any other element of the 
vector for stable integration of the vector into the genome by homologous or nonhomolo- 
gous recombination. Alternatively, the vector may contain additional nucleotide sequences 
for directing integration by homologous recombination into the genome of the host cell. 

2 0 The additional nucleotide sequences enable the vector to be integrated into the host cell 
genome at a precise location(s) in the chromosome(s). To increase the likelihood of inte- 
gration at a precise location, the integrational elements should preferably contain a suffi- 
cient n umber o f n ucleotides, s uch a s 1 00 1 o 1 ,500 b ase p airs, p referably 4 00 1 o 1 ,500 
base pairs, and most preferably 800 to 1,500 base pairs, which are highly homologous 

25 with the corresponding target sequence to enhance the probability of homologous recom- 
bination. The integrational elements may be any sequence that is homologous with the 
target sequence in the genome of the host cell. Furthermore, the integrational elements 
may be non-encoding or encoding nucleotide sequences. On the other hand, the vector 
may be integrated into the genome of the host cell by non-homologous recombination. 

30 

For autonomous replication, the vector may further comprise an origin of replication ena- 
bling the vector to replicate autonomously in the host cell in question. Examples of bacte- 
rial origins of replication are the origins of replication of plasmids pBR322, pUC19, pA- 
CYC177, and pACYC184 permitting replication in E. co//, and pUB110, pE194, pTA1060, 
35 and pAMB1 permitting replication in Bacillus. Examples of origins of replication for use in a 
yeast host cell are the 2 micron origin of replication, ARS1, ARS4, the combination of 
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ARS1 and CEN3, and the combination of ARS4 and CEN6. The origin of replication may 
be one having a mutation which makes it's functioning temperature-sensitive in the host 
cell (see, e.g., Ehrlich (1978) Proceedings of the National Academy of Sciences USA 
75:1433). 

5 

More than one copy of a nucleotide sequence of the present invention may be inserted into 
the host cell to increase production of the gene product. An increase in the copy number of 
the nucleotide sequence can be obtained by integrating at least one additional copy of the 
sequence into the host cell genome or by including an amplifiable selectable marker gene 

10 with the nucleotide sequence where cells containing amplified copies of the selectable 
marker gene, and thereby additional copies of the nucleotide sequence, can be selected 
for by cultivating the cells in the presence of the appropriate selectable agent. 
The procedures used to ligate the elements described above to construct the recombinant 
expression vectors of the present invention are well known to one skilled in the art (see 

15 e.g. Sambrook et al. (1989) supra). 

Recombinant hos t cell comprising nucleic acid construct 

The present invention also relates to recombinant a host cell comprising the nucleic acid 
construct of the invention, which are advantageously used in the recombinant production 
20 of the polypeptides. A vector comprising a nucleotide sequence of the present invention is 
introduced into a host cell so that the vector is maintained as a chromosomal integrant or 
as a self-replicating extra-chromosomal vector as described earlier. 

The host cell may be a unicellular microorganism, such as a prokaryote or a non- 
25 unicellular microorganism, such as a eukaryote. Useful unicellular cells are bacterial cells 
such as gram positive bacteria including, but not limited to, a Bacillus cell, e.g., Bacillus 
alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans. Bacillus clausii, 
Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis. Bacillus 
megaterium, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a 
3 0 Streptomyces cell, e.g., Streptomyces lividans or Streptomyces murinus, or gram negative 
bacteria such as E coli and Pseudomonas sp. In a preferred embodiment, the bacterial 
host cell is a Bacillus lentus, Bacillus licheniformis, Bacillus stearothermophitus, or Bacillus 
subtilis cell. In another preferred embodiment, the Bacillus cell is an alkalophilic Bacillus. 
The introduction of a vector into a bacterial host cell may, for instance, be effected by pro- 
35 toplast transformation (see, e.g., Chang and Cohen (1979) Molecular General Genetics 
168:1 11-1 15), using competent cells (see, e.g., Young and Spizizin (1961) Journal of Bac- 
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teriology 81:823-829, or Dubnau and Davidoff-Abelson (1971) Journal of Molecular Biology 
56:209-221), etectroporation (see, e.g., Shigekawa and Dower (1988) Biotechniques 
6:742-751), or conjugation (see, e.g., Koehler and Thorne (1987) Journal of Bacteriology 
169:5771-5778). 

5 

The host cell may be a eukaryote, such as a mammalian, insect, plant, or fungal cell. In a 
preferred embodiment, the host cell is a fungal celL "Fungi" as used herein includes the 
phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined by 
Hawksworth et al. (1995) In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition, 

10 CAB International, University Press, Cambridge, UK) as well as the Oomycota (as cited in 
Hawksworth et al. (1995) supra) and ail mitosporic fungi (Hawksworth et al. (1995) supra). 
In a more preferred embodiment, the fungal host cell is a yeast cell. "Yeast" as used herein 
includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast 
belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may 

15 change in the future, for the purposes of this invention, yeast shall be defined as described 
in Biology and Activities of Yeast (Skinner, Passmore and Davenport, eds, (1980) Soc. 
App. Bacterid. Symposium Series No. 9). 

In an even more preferred embodiment, the yeast host cell is a Candida, Hansenula, Kluy- 
veromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell. In a most 
20 preferred embodiment, the yeast host cell is a Saccharomyces carlsbergensis, Saccharo- 
myces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasfi, Saccharomyces 
kluyveri, Saccharomyces norbensis or Saccharomyces oviformis cell. In another most pre- 
ferred embodiment, the yeast host cell is a Kluyveromyces lactis cell In another most pre- 
ferred embodiment, the yeast host cell is a Yarrowia lipoiytica cell. 

25 

In another more preferred embodiment, the fungal host cell is a filamentous fungal cell. 
"Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomy- 
cota (as defined by Hawksworth et al. (1995) supra). The filamentous fungi are character- 
ized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, rnannan, and other 
3 0 complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabo- 
t lism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces 
cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermenta- 
tive. 

35 In an even more preferred embodiment, the filamentous fungal host cell is a cell of a spe- 
cies of, but not limited to, Acremonium, Aspergillus, Fusarium, Humicola, Mucor t My- 
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celiophthora, Neurospora, Penicillium, Thielavia, Tolypocladium, or Trichoderma. In a most 
preferred embodiment, the filamentous fungal host cell is an Aspergillus awamori, Asper- 
gillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger or Aspergillus 
oryzae cell. In another most preferred embodiment, the filamentous fungal host cell is a 
5 Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, 
Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, 
Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, 
Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum t Fusarium toru- 
losum, Fusarium trichothecioides, or Fusarium venenatum cell. In an even most preferred 

10 embodiment, the filamentous fungal parent cell is a Fusarium venenatum cell (Nirenberg 
sp. nov. such as the Fusarium venenatum deposited under Nos. CBS 458.93, CBS 
127.95, CBS 128.95, CBS 148.95). In another most preferred embodiment, the filamen- 
tous fungal host cell is a Humicola insolens, Humicota lanuginosa, Mucor miehei, My- 
celiophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Thielavia ter- 

15 restris t Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, 
Trichoderma reesei, or Trichoderma viride cell. 

Fungal cells may be transformed by a process involving protoplast formation, transforma- 
tion of the protoplasts, and regeneration of the cell wall in a manner known perse. Suitable 
20 procedures for transformation of Aspergillus host cells are described in EP 238 023 and 
Yelton et al. (1984) Proceedings of the National Academy of Sciences USA 81:1470-1474. 
Suitable methods for transforming Fusarium species are described by Malardier et al. 
(1989) Gene 78:147-156 and WO 96/00787. Yeast may be transformed using the proce- 
dures described by Becker and Guarente, In Abelson and Simon, eds, Guide to Yeast Ge- 

2 5 netics and M olecular B iology, Methods in Enzymology, Volume 1 94:182-187, Academic 

Press, Inc., New York; Ito et al. (1983) Journal of Bacteriology 153:163; and Hinnen et al. 
(1978) Proceedings of the National Academy of Sciences USA 75:1920. 

Processes for preparing functional CBM's 

3 0 The present invention also relates to methods for producing a CBM of the present inven- 

tion comprising (a) cultivating a strain, which in its wild-type form is capable of producing 
the CBM; and (b) recovering the CBM. Preferably, the strain is a fungus, more preferably 
of the genus Humicola, particularly Humicola insolens or Coprinus. such as Coprinus 
cinereus or Thielavia such as Thielavia terrestris or Aspergillus such as Aspergillus oryzae. 

35 

The present invention also relates to a method for producing a CBM polypeptide, the 



10499.000-DK 



32 

method comprising the steps of 

- growing under conditions to overproduce CBM's in a nutrient medium Aspergillus host 
cells which have been transformed with an expression cassette which includes, as opera- 
bly joined components, 
5 a) a transcriptional and translational initiation regulatory region, 

b) a DNA sequence encoding the CBM polypeptide, 

c) a transcriptional and translational termination regulatory region, wherein the regulatory 
regions are functional in the host, and 

d) a selection marker gene for selecting transformed host cells; and 
1 0 recovering the CBM polypeptide. 

In the production methods of the present invention, the cells are cultivated in a nutrient 
medium suitable for production of the CBM using methods known in the art. For example, 
the cell may be cultivated by shake flask cultivation, small-scale or large-scale fermenta- 

15 tion (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or 
industrial fermentors performed in a suitable medium and under conditions allowing the 
polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutri- 
ent medium comprising carbon and nitrogen sources and inorganic salts, using procedures 
known in the art. Suitable media are available from commercial suppliers or may be pre- 

20 pared according to published compositions (e.g., in catalogues of the American Type Cul- 
ture Collection). If the CBM is secreted into the nutrient medium, the CBM can be recov- 
ered directly from the medium. If the CBM is not secreted, it can be recovered from cell 
lysates. 

25 The produced CBM may be detected using methods known in the art and modifications 
thereof that are specific for the CBM. These detection methods may include use of specific 
antibodies or determination of binding to a carbohydrate substrate, such as Avicel. 

The resulting CBM may be recovered by methods known in the art. For example, the CBM 
30 may be recovered from the nutrient medium by conventional procedures including, but not 
limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. 
The polypeptides of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., 
35 preparative isoelectric focusing), differential solubility (e.g. ammonium sulfate precipita- 
tion), SDS-PAGE, or extraction (see, e.g. Protein Purification, Janson and Ryden, eds 
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(1989) VCH Publishers, New York). 

MATERIALS AND METHODS 

Determination CBM activity 

A DNA sequence encoding a CBM from a given organism can be obtained conventionally 
by using PCR techniques, and, also based on current knowledge it is possible to find ho- 
mologous sequences from other organisms. 

It is contemplated that new CBM's can be found by cloning carbohydrate degrading en- 
zymes such as cellulases, xylanases or other plant cell wall degrading enzyme and meas- 
ure the binding to the target carbohydrate. Traditionally it has been assumed, that if the 
enzyme activity binds to the crystalline cellulose product Avicel® part of the gene codes for 
a cellulose-binding domain. The binding to Avicel® is tested under the standard conditions 
described below. 

Cellulose affinity can be measured by using 10 g of Avicel® in a 500 ml buffered slurry 
(buffer: 0.1 sodium phosphate, pH 7.5) which is stirred slowly using a spoon and left swell- 
ing for 30 minutes at room temperature. Then the enzyme is added in a ratio of 1 part cel- 
lulose binding domain to 150 parts Avicel®. This is done on ice which gives optimum bind- 
ing within 5 to 10 minutes. The Avicel® can then be washed and applied directly to SDS- 
PAGE for visualization of the bound proteins (since the use of SDS and cooking will re- 
lease the bound proteins). Alternatively, the slurry is packed into a column and washed. 
The bound protein is eluted f either in ionized water or in a high pH buffer such as triethyl- 
amine (pH 11.2; 1% solution), where the pH eluted protein is quickly adjusted- to neutral. 

General molecular biology methods 

DNA manipulations and transformations are performed using standard methods of molecular 
biology as described in Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Lab., Cold Spring Harbor, NY (1989); Ausubel et al. (Eds.), Current Protocols 
in Molecular Biology, John Wiley and Sons (1995); Harwood and Cutting (Eds.) Molecular 
Biological Methods for Bacillus, John Wiley and Sons (1990). 

Enzymes for DNA manipulations were used according to the specifications of the suppliers. 
EXAMPLE 1 

Construction of an Aspergillus expression vector for a CBM domain from Pseudo- 
ptectenia nigrella which domain was secreted with a family GH61 enzyme. 
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Expression constructs of SEQ ID NO:1 were created by two different cloning procedures. The 
first procedure (A) uses inverse PCR to delete the enzymatic core of the family GH61 enzyme 
obtained from Pseudoplectania nigretta. Ligation of the product resulted in a plasmid contain- 
ing the native secretion signal of the GH61 enzyme, fused in frame to the DNA encoding the 
5 carbohydrate-binding module. The second method (B) pursued for recombinant overexpres- 
sion of the CBM of the invention was to clone the DNA encoding the CBM domain into a vec- 
tor containing a Candida lipase signal peptide. 

A - Inverse PCR 

10 Primers NP887U1 and NP887D1 were synthesized as 5' phosphorylated primers. Amplifica- 
tion of plasmid DNA encoding the full open reading frame (ORF) of the family GH61 enzyme 
was used as template. The OFR can be obtained from the deposited strain CBS 444.97 by 
use of primers NP887U1 and NP887D1. Approximately 100 nanograms of DNA were used 
as template in a PCR reaction with the two primers A and B. 

15 

NP887U1 (SEQ ID NO:3) 

5-GACATCGTTGACGGAGAGTCCGTAGACACGA-3 1 

NP887D1 (SEQ ID NO:4> 

2 0 5 , -ACATCCTCCGGCACCTCCAATGACAAGGCCGTCG-3 , 

The protocol followed was the basic protocol of the Excite PCR mutagensis as described in 
the Stratagene (USA) user manual Catalogue number 200502. 

25 Component 

dH 2 0 32J5pl 

10XPfuUltra buffer 5.0pl 

dNTPs (10mM stock) 1,25pl 

NP887U1 (100pMol/pl) 5,0pl 

3 0 NP887D1 (100pMol/pl) 5,0pl 

1 pi of PfuUltra (Stratagene, USA) was added to the reaction and then the sample was 
placed in a thermal cycler under the following conditions: 

95 degrees Celsius 2 min 
3 5 followed by 20 cycles of 

95 degrees Celsius 2 minutes 
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55 degrees Celsius 
72 degrees Celsius 



30 seconds 



7 minutes 



5pl of the PCR reaction was removed for agarose gel analysis to confirm amplification of a 
5 band of the expected size (ca, 6 Kb). To the remaining 45pl, 40 units of Dpnt restriction 
enzyme were added. The sample was returned to the thermal cycler and incubated at 37 
degrees Celsius for 30 minutes followed by 65 degrees Celsius for 30 minutes. The treated 
sample was purified by the GFX column purification kit according to the manufacturer's 
instructions (Amersham Biosciences, USA). 



0.5jjI New England Biolabs T4 DNA ligase (ca. 200 NEB cohesive end units) 

15 The ligation was performed at 17 degrees Celsius overnight and then transformed into 

TOP10 Chemically competent cells according to the manufacturers instructions. The trans- 
formation was plated out on LB agar with 50mg/liter ampicillin. Eleven of the several hun- 
dred colonies that grew were miniprepped (Qiaspin® columns, Qiagen Ltd.) and the DNA 
cut with EcoRI and Not! to liberate the insert. Eight out of the eleven plasmids had an in- 

20 sert of the correct size (ca. 700bp). The insert was sequenced for these plasmids contain- 
ing inserts. The colonies were sequenced with vector primer PNA2I (5-GTT TCC AAC 
TCA ATT TAC CTC-3' SEQ ID NO:5). It was determined that no errors were introduced in 
any of the insert sequences as a result of PCR. Of the eight plasmids pCBMX-K1 was 
chosen for a medium scale JetStar^ (GENOMED, Germany) plasmid preparation from 

25 100jj! of LB ampicillin grown plasmid containing E. coli cells. 

The DNA sequence of the fusion construction of pCBMX-K1, and the corresponding amino 
acid sequence, are shown in SEQ ID NO:1 and SEQ ID NO:2, respectively. 

Transformation of construct pCBMX-K1 into Aspergillus orvzae 

30 The DNA of SEQ ID NO:1 was transformed into Aspergillus oryzae strain JAL355 (dis- 
closed in intemationaf patent application WO 01/98484A1). Transformants of SEQ ID NO:1 
was re-isolated twice under selective and non-inducing conditions on Cove minimal plates 
(Cove (1966) Biochim. Biophys. Acta 133:51-56) with 1M sucrose as a carbon source and 
10mM nitrate. To test expression of SEQ ID NO:1 r transformants were grown for 3 days 

35 and 4 days at 30 degrees Celsius in tubes with 10 ml YPM (2% peptone. 1% yeast extract, 
2% maltose). Supernatants were run on NuPage® 10% Bis-Tris SDS gels (Invitrogen, 



10 



treated PCR sample 
10X ligation buffer 



9pl 
1 Ml 



10499.000-DK 



36 

USA) as recommended by the manufacturer. All Aspergillus isolates grew well even when 
induced for the expression of the DNA of SEQ ID NO:1 . 

B - Construction and expression of CBM using a Candida antarctlca lipase signal peptide 
5 The second method pursued for recombinant overexpression of the CBM domain of the 
invention was to clone the CBM domain into a specially prepared vector containing the 
necessary Aspergillus regulatory elements (promoter, terminator, etc.) and a signal peptide 
with signal cleavage site of a secreted lipase gene from Candida antarctica. Cloning of the 
PCR product consisting of the CBM domain into the vector allows for an in frame fusion of 
10 the CBM domain with the signal peptide. Expression of the construct in Aspergillus oryzae 
should result in efficient secretion of the enzyme due to the presence of the provided se- 
cretion signal and cleavage site. The vector used, pDau109 is a derivative of pJAL721, 
which is described in WO 03/008575. 

The plasmid pDau109 differs from pJaL721 in that the ampicillin resistance gene has been 
15 inserted into the pyrG selectable marker. The improvements made in pDau109 vector are 
first, the selection marker URA3 of E. coli that has been replaced by a URA3 gene dis- 
rupted by the insertion of the ampicillin resistance gene E. coli beta lactamase. This fea- 
ture allows for facile selection for positive recombinant E. coli clones using commercially 
available and highly competent strains on commonly used LB ampicillin plates. Further- 

2 0 more the Ampicillin resistance gene is entirely removable using the two flanking Notl sites 

restoring a functional selection marker URA3. In addition, pDau109 has a Candida antarc- 
tica lipase (SW ALL: LI PB_CAN AR) signal sequence (amino acids 1-57 of SEQ ID NO:9) 
and cleavage site introduced after the fungal promoter in which a number of convenient 
cloning sites are available for in frame fusions of a supplied coding region with the C. ant- 
25 arctica secretion signal. Specifically, pDau109 has 8 unique restriction sites that can be 
used to insert a cDNA (BstXI-Fspl-Spel-Nrul-Xcml-Hindlll-Xhol). Standard methods were 
used for modification of pJaL724 into pDau109. 

Plasmid of pDau109 was prepared by medium scale Qiagen®midi plasmid preparation 

3 0 (Qiagen) from 100mts of LB ampicillin grown plasmid containing E. coli cells. 

Generation of the CBM domain with Hindi II sites 

The following primers were used in a standard PCR reaction, the Hindlll restriction sites 
introduced for cloning purposes are underlined: 
35 NP887Dau1 (SEQ ID NO:6) 

5- CCAAAGCTTT TCATCCTCCGGCACCTCCAATG-3' 
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NP887Dau2 (SEQ ID NO:7) 

5 , -GCGAAGCTTA ATCTTACTCCATCTCACCTCCC■3 , 

Plasmid NP887-1 encoding the GH61 coding region was used as PCR template. 

5 

pNP887 DNA(100ng) 1pl 
1 0X Proof Start buffer 5pl 
dNTP 20mM 0.75|jl 
NP887Dau1(100pMol/pl) 0.5pl 
1 o NP887Dau2(100pMol/pl) 0.5pl 
H z O 41.25pl 
ProofStart® DNA polymerase (Qiagen Ltd) 1 \i\ 

The sample was transferred to a thermal cyder and the following program run: 
95 degrees Celsius 5 min 
15 Then 20 cycles of 

94 degrees Celsius 30 seconds 
60 degrees Celsius 30 seconds 
72 degrees Celsius 1 minute 

20 Five pi of the PCR reaction was inspected on an Agarose gel and a band corresponding to 
the correct size (ca. 500bp) was observed. The remaining 45 pi of sample was purified by 
the GFX purification method (Amersham Biosciences). The sample was then restricted 
with Hindlll under standard conditions (40units/pg DNA, 37 degrees Celsius) overnight. 
The treated fragment (NP887-CBM) was once more GFX purified and stored at -20 de- 

25 grees Celsius until further use. 

Preparation of pDau109 

pDau109 plasmid DNA was restricted with Hindlll for four hours under standard conditions. 
10 units of Shrimp Alkaline Phosphatase was added to the reaction and the incubation at 
30 37 degrees Celsius continued for an additional 2 hours. The sample was then heat treated 
at 65 degrees Celsius for 20 minutes to inactivate the enzyme. The treated plasmid was 
GFX purified and stored at -20 degrees Celsius until later use. 

Ligation 

35 Vector pDau109 Hindlll* 1pl (ca. 90ng) 

NP887-CBM PCR frag-Hindlll 7pl 
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T4 DNA ligase buffer (NEB) 1 pi 
NEB T4 DNA ligase 0.3pl 

The ligation was performed at 16 degrees Celsius overnight and then stored in -20 degree 
Celsius until used. 

5 

Transformation 

The ligation was performed at 16 degrees Celsius overnight and then transformed into 
TOP10 Chemically competent cells according to the manufacturers instructions. The trans- 
formation was plated out on LB agar with 50mg/liter ampicillin. Twelve of the several hun- 

10 dred colonies that grew were miniprepped (Qiaspin® columns, Qiagen Ltd.) and the DNA 
cut Hindlll to liberate the insert. Four out of the twelve plasmids had an insert of the correct 
size (ca. 500bp). The plasmids containing inserts were sequenced with vector primer 
PNA2I (SEQ I D NO:5) to determine integrity and orientation of the insert. It was deter- 
mined that no errors were introduced in any of the insert sequences as a result of PCR. 

15 Plasmid pCBMX-S1 was found to be error free and in the correct orientation and was 
therefore chosen for a medium scale Qiagen® midi plasmid preparation (Qiagen) from 
100pl of LB ampicillin grown plasmid containing E. coli cells. The DNA sequence of the 
fusion construct pCBMX-S1 , and the corresponding amino acid sequence, are shown in 
SEQ ID NO:8 and SEQ ID NO:9 f respectively. 

20 

Transformation of construct pCBMX-S1 into Aspergillus orvzae: 

The fusion construct pCBMX-S1 (SEQ ID NO:8) was transformed into Aspergillus oryzae 
strain BECh2, which was constructed as described in WO 00/39322 (BECh2 is derived 
from strain Aspergillus oryzae JaL228, which is constructed on the basis of the deposited 
2 5 strain Aspergillus oryzae !FO 41 77 as described in WO 98/1 2300). 

Transformation media AMDS media: 
Agarose 20g 

Cove salt 20ml (Cove (1966) supra) 

30* Sucrose 342 g 

dH20 to 100ml 

* autoclave at 121 degrees Celsius for 20 minutes allow to cool then add: 
1M acetamide 10ml 
IMCsCI 15ml 

35 

AMDS media for re-isolation of transformants: The same as above but without added CsCI 
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and adding 10Qpl triton-X100 per 1000ml media. 

Transformants of pCBMX-S1 were re-isolated twice on Cove sucrose media. (Cove (1966) 
Biochtm. Biophys. Acta 133:51-56) with 1M sucrose as a carbon source and 10rnM nitrate. 
To test expression of the fusion construct pCBMX-S1, which contains the Candida antarc- 
5 tica lipase (SWALL:LIPB_CANAR) signal sequence and the P. nigrella CBM polypeptide of 
the invention (SEQ ID NO:8), 23 transformants were grown for 3 days and 4 days at 30 
degrees Celsius in tubes with 10 ml YPG (2% peptone. 1% yeast extract, 2% glucose). 
Supernatants were run on NuPage® 10% Bis-Tris SDS gels (Invitrogen, USA) as recom- 
mended by the manufacturer. SYPRO* 1 Orange Gel staining was used according to the 

10 manufacturer's instructions (Molecular Probes, USA). Three of the isolates revealed a dif- 
fuse band between 35-45 kDa on the SDS gel. These were analyzed further with various 
150 ml shake flask media fermentations. 1000ml Erlenmeyer flasks with side baffles was 
used with 150mls of each of 4 different media: 
YPM : 2% peptone, 1% yeast extract, 2% maltose 

15 YPG : 2% peptone, 1% yeast extract, 2% glucose 

DAP2C : For 1 liter media; MgS04.7H20 (Merck 5886) 11g, KH2P04 (Merck 4873) 1 
g, Citric Acid (Merck 244) 2 g, Maltodextrin (Roquette), K3P04.H20 5.2 g, Yeast ex- 
tract (Difco 0127) 0>5g, AMG spore metals 0.5mls (Zink Chloride: Merck 8816, 6.8g; 
Copper Sulphate: Merck 2790, 2.5g; Nickel Chloride: Merck 6717, 0.24g; Iron sul- 

20 phate: Merck 3965, 13.9g; Manganese sulphate: Merck 5941, 8.45g; Citric acid: Merck 
0244, 3g), Pluronic® PE 6100 (BASF). 

FG4P : 3% Soybean meal (SFK 102-2458), 1.5% Maltodextrin (Roquette), 0.5% Pep- 
tone bacto (Difco 0118), 1.5% KH2P04 (Merck 4873), 0.2mls/liter Pluronic® PE 6100 
(BASF). 

25 A heavy inoculum of several thousand spores was used for each and the shake flasks 
were agitated on an orbital shaker at 150 RPM at 30 degrees C. 

Aspergillus isolates grew well even when induced for the expression of the CBM polypep- 
tide of the invention. 

30 EXAMPLE 2 

■ Purification of SEQ ID NO:9 from expression of SEQ ID NO:8 in Aspergillus. 

The Aspergillus oryzae strain described in Example 1 B expressing the CBM (CBMX) of the 
Pseudoplectania nigrella GH61 with Candida antarctica lipase signal peptide was grown in 
shake flasks. About 1 liter culture broth was sterile filtered and the filtrate loaded onto a 
35 column containing 50 g Avicel. Non-binding and weakly binding proteins were removed by 
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washing with Milli-Q® water. Proteins with affinity for Avicel were eluted with 0.1 M Tris, pH 
11.5. Immediately after elution, pH of this Avicel-binding fraction was adjusted to 7.5, and 
the fraction was concentrated using an Amicon® cell (Millipore® USA) with a membrane 
having a cut-off of 6 kDa. On SDS-PAGE the majority of the protein binding to Avicel ap- 
5 peared as a broad band of molecular weight 35-45 kDa, which is considerably higher than 
the molecular weight of the protein part of the carbohydrate binding module. The high and 
heterogeneous molecular weight is probably due to heterogeneity in O- and N- 
glycosylation of the N-terminal part of the protein. N-terminal sequencing of the 35-45 kDa 
band gave exclusively the sequence SFSSSGT (positions 47-53 of SEQ ID NO:9) indicat- 
10 ing that heterogeneity in the N-terminal amino acid sequence is not present. 

EXAMPLE 3 

Specificity of binding of purified CBMX 

The carbohydrate-binding domain with affinity for Avicel and purified as described in Ex- 
15 ample 2 (CBMX) was studied further. 50 pi purified CBMX was mixed with 500 pi 20 mM 
Tris, pH 7.5 containing varying amount of Avicel (0-100 mg/ml) in an Eppendorf tube. After 
4 hours incubation at room temperature with agitation, the samples were centrifuged. 200 
pi supernatant was transferred to the well of a microtiter plate (Costar, UV plate) and ab- 
sorbance read at 280 nm on a microtiter plate reader (SpectraMax® Plus, Molecular De- 
20 vices). The results in Table 1 indicate that the large majority of the protein binds to the 
highest concentrations of Avicel. 

Table 1 : Binding of CBMX to Avicel. A280: Absorbance at 280 nm of 200 pi supernatant in 
microtiter plate with absorbance of buffer and Avicel subtracted. 



Avicel (mg/ml) 


A280 


100 


0.0095 


50 


0.0185 


25 


0.0326 


12.5 


0.0540 


6.25 


0.0637 


3.125 


0.0729 j 


1.563 


0.0799 


0.781 


0.0808 


0.391 


0.0777 


0.0 


0.0768 
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Experiments with shorter incubation time (15 min to 1 hour) gave less complete binding. 

A similar binding study was performed with PASC (Phosphoric Acid Swollen Cellulose: To 
5 g Avicel moisted with water 150 ml ice-cold 85 ortho-phosphoric acid is added. After 1 
5 hour stirring on ice bath, 500 ml cold acetone is added. The suspension is filtered and 
washed, first with acetone and then with water). 50 |jl purified CBMX was mixed with 500 
pi 20 mM Tris, pH 7.5 containing varying amount of PASC (0-10 mg/ml) in an Eppendorf 
tube. Samples were incubated 4 hours at room temperature with agitation. After centrifuga- 
tion, 200 pi supernatant was transferred to the well of a microtiter plate and absorbance 
10 read at 280 nm. The results in Table 2 show that increasing amount of PASC reduces the 
amount of absorbance of CBMX in the supernatant, i.e. CBMX has affinity for PASC. 

Table 2: Binding of CBMX to PASC. A280: Absorbance at 280 nm of 200 pi supernatant in 
microtiter plate with absorbance of buffer subtracted. 



PASC (mg/ml) 


A280 


10 


0.0471 


5 


0.0512 


2.5 


0.0618 


1.25 


0.0710 


0.625 


0.0682 


0.0 


0.0753 



15 

Affinity of CBMX for a number of soluble carbohydrates was tested in a competition assay 
by mixing 100 pi CBMX with both Avicel (400 pi 50 mg/m! in 20 mM Tris, pH 7.5) and the 
soluble carbohydrate (dissolved in 500 pi 20 mM Tris, pH 7.5). As references, samples 
without CBMX or soluble carbohydrate added were used. If CBMX has affinity for the solu- 
20 ble carbohydrate it should be able to keep CBMX in solution which can be measured as 
increase in absorbance at 280 nm compared to sample without soluble carbohydrate 
added. After 4 hours incubation at room temperature with agitation, samples were centri- 
fuged and absorbance at 280 nm was read using 200 pi supernatant in the well of a micro- 
< titer UV plate. 

25 Tested soluble carbohydrates were barley B-glucan (Megazyme, low viscosity), iichenan 
(Megazyme, Icelandic moss), CMC (carboxymethyl cellulose 7LF, Hercules, USA) t Xy- 
loglucan (Megazyme, amyloid, from tamarind seed), lupin galactan (Megazyme) and Lo- 
cust bean gum (Sigma, G-0753). 
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From the results in Table 3 it is seen that beta-glucan and CMC are able to keep almost all 
CBMX in solution. Also locust bean gum keeps the majority in solution, whereas xylogiu- 
can and galactan result in about half of the CBMX in solution. Addition of lichenan does not 
result in any increase in CBMX in solution. These results indicate affinity of CBMX for beta- 
glucan, CMC and locust bean gum and to some extent also for xyloglucan and galactan, 
whereas no affinity for lichenan could be detected. 

Table 3: Competition binding assay with CBMX, Avicel (20 mg/ml) and soluble carbohy- 
drates. Concentration of carbohydrate: Concentration of soluble carbohydrate during incu- 
bation with CBMX and Avicel. Difference in A280: Difference in absorbance at 280 nm be- 
tween samples with and without CBMX added. 



Soluble carbohydrate 


Carbohydrate cone, (mg/ml) 


Difference in A280 


None - Only Avicel and CBMX 




0.018 


None - Only CBMX 




0.108 


beta-Glucan 


20 


0.100 


Lichenan 


20 


0.025 


CMC 


25 


0.129 


Xyloglucan 


11 


0.052 


Galactan 


20 


0.043 


Locust bean gum 


25 


0.087 
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CLAIMS 

1. A carbohydrate-binding module which is 

(a) a polypeptide encoded by the DNA sequence of positions 109-531 of SEQ ID NO;1, or 
5 a DNA sequence homologous to SEQ ID NO:1, which DNA sequence has at least 50% 

identity with positions 109-531 of SEQ ID NO:1 or 

(b) a polypeptide produced by culturtng a ceil comprising the DNA sequence of positions 
109-531 of SEQ ID NQ:1 under conditions wherein the DNA sequence Is expressed or 

(c) a polypeptide having the amino acid sequence of positions 34-174 of SEQ ID NO:2 » or 
10 a polypeptide homologous to SEQ ID NO:2, which polypeptide has an amino acid se- 
quence of at least 50% identity with positions 34-174 of SEQ ID NO:2 or 

(d) a polypeptide encoded by a DNA sequence that hybridizes to the DNA sequence of 
positions 109-531 of SEQ ID NO:1 under low stringency conditions or 

(e) a polypeptide encoded by an isolated polynucleotide molecule which polynucleotide 
15 molecule hybridizes to a denatured double-stranded DNA probe under low stringency con- 
ditions, wherein the probe is selected from the group consisting of DNA probes comprising 
the sequence shown in positions 109-531 of SEQ ID NO:1. and DNA probes comprising a 
subsequence of positions 109-531 of SEQ ID NO:1. the subsequence having a length of at 
least about 300 base pairs. 

20 

2. The carbohydrate-binding module of claim 1 , which is encoded by a DNA sequence ob- 
tainable from Pseudoptectanta nigrelta CBS 444.97. 

3. An isolated polynucleotide molecule encoding a polypeptide having carbohydrate-binding 

2 5 module activity selected from the group consisting of. 

(a) polynucleotide molecules comprising a nucleotide sequence as shown in SEQ ID NO:1 
from nucleotide 109 to nucleotide 531; 

(b) polynucleotide molecules that encode a polypeptide that is more than 50% identical to the 
amino add sequence of positions 34-174 of SEQ ID NO:2; or a fragment thereof that has 

3 0 carbohydrate-binding module activity; 

(c) molecules complementary to (a) or (b); and 

(d) degenerate nucleotide sequences of (a) or (b). 

4. The isolated polynucleotide molecule according to claim 3, wherein the polynucleotide Is 
3 5 DNA. 
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5. An isolated polynucleotide molecule encoding a polypeptide having carbohydrate-binding 
module activity which polynucleotide molecule hybridizes to a denatured double-stranded 
DNA probe under low stringency conditions, wherein the probe is selected from the group 
consisting of DNA probes comprising the sequence shown in positions 109-531 of SEQ ID 

5 NO:1 and DNA probes comprising a subsequence of positions 109-531 of SEQ ID NO:1 hav- 
ing a length of at least about 300 base pairs. 

6. The isolated polynucleotide molecule according to claim 3 which is isolated from or pro- 
duced on the basis of a DNA library from a prokaryote, such as a bacterium or an eukaryote, 

1 o such as a fungus or yeast. 

7. The isolated polynucleotide molecule according to claim 6 which is isolated from or pro- 
duced on the basis of a DNA library from a strain of Pseudoptectania, preferably the strain 
Pseudoplectania nigrella CBS 444.97. 

15 

8. A polynucleotide construct comprising the polynucleotide molecule according to any of 
claims 3-7. 

9. The polynucleotide construct of claim 8 comprising one or more control sequences, such 
20 as a promoter, a leader sequence, a polyadenylation sequence, a signal peptide, a propep- 
tide and a transcription terminator sequence . 

10. An expression vector comprising the following operably linked elements: a transcription 
promoter, a DNA segment selected from the group consisting of (a) polynucleotide molecules 
encoding a polypeptide having carbohydrate-binding module activity comprising a nucleotide 
sequence as shown in SEQ ID NO:1 from nucleotide 109-531, (b) polynucleotide molecules 
encoding a polypeptide having carbohydrate-binding module activity that is nriore than 50% 
identical to the amino acid sequence of position 34-174 of SEQ ID NO:2 or a fragment thereof 
that has carbohydrate-binding module activity; and (c) degenerate nucleotide sequences of 
(a) or (b); and a transcription terminator. 

1 1 . A cultured cell into which has been introduced an expression vector according to claim 
10, wherein said cell expresses the polypeptide encoded by the DNA segment 

3 5 12. The cell according to daim 1 1 , which is an eukaryotic cell, in particular a fungal cell, or an 
endogenous cell from which the DNA segment, encoding the polypeptide exhibiting endo- 



25 



30 
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beta-1,4-glucanase activity, originates. 

13. The cell according to claim 12, wherein the cell belongs to a strain of Aspergillus, prefera- 
bly a strain of Aspergillus oryzae, preferably the strain Aspergillus oryzae BECh2. 

5 

14. A method of producing a polypeptide having carbohydrate-binding module activity com- 
prising culturing a cell according to claim 11, whereby said cell expresses a polypeptide en- 
coded by the DNA segment; and recovering the polypeptide. 

10 15. An isolated polypeptide having carbohydrate-binding module activity, in which the poly- 
peptide is (i) free from homologous impurities, and (ii) produced by the method according to 
claim 14. 

16. A composition comprising a CBM according to claims 1, 2 or 15. 

15 

17. The composition of claim 16 further comprising one or more enzymes selected from the 
group consisting of proteases, cellulases, beta-glucanases, hemicellulases, lipases, peroxi- 
dases, laccases, alpha-amylases, glucoamylases, cutinases, pectinases, reductases, oxi- 
dases, phenoloxidases, ligninases, pullulanases, pectate lyases, xyloglucanases, xylanases, 

20 pectin acetyl esterases, polygalacturonases, rhamnogalacturonases, pectin lyases, other 
mannanases, pectin methylesterases, cellobiohydrolases, transglutaminases; or mixtures 
thereof. 

18. A method for degradation of cellulose-containing biomass, wherein the biomass is treated 
25 with an effective amount of the carbohydrate-binding module according to any of claims 1-2 

and 15 or of the composition according to claims 16 or 17. 

19. A hybrid endo-glucanase, exhibiting endo-beta-1,4-glucanase activity comprising a CBD 
according to claims 1 , 2 or 1 5 and a catalytic domain. 

30 

20. A composition comprising a carbohydrate-binding module according to claims 1, 2 or 15 
or the hybrid endo-glucanase of claim 19. 

21. Use of a carbohydrate-binding module according to claims 1, 2 or 15 or the hybrid endo- 
3 5 glucanase of claim 1 9 in a detergent composition. 
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22. Use of a carbohydrate-binding module according to any of claims to claims 1, 2 or 15 or 
the hybrid endo-glucanase of claim 19 in textile finishing processes. 

23. Use of a carbohydrate-binding module according to any of claims to claims 1, 2 or 15 for 
5 purification of polypeptides. 

24. Use of a carbohydrate-binding module according to any of claims to claims 1, 2 or 15 for 
immobilisation of active enzymes. 

10 25. Use of a carbohydrate-binding module according to any of claims to claims 1 , 2 or 15 for 
baking. 

26. Use of a carbohydrate-binding module according to any of claims to claims 1, 2 or 15 for 
manufacturing of biofuel. 

15 

27. Use of a carbohydrate-binding module according to any of claims to claims 1, 2 or 15 for 
modification of plant cell walls. 

28. Use of a carbohydrate-binding module according to any of claims to claims 1, 2 or 15 for 
2 0 processing of cellulose fibre. 
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ABSTRACT 

The present invention relates to non-catalytic carbohydrate-binding modules (CBM) belonging 
to a new family of CBM's. A CBM of the invention was found attached to a glycosyl hydrolase 
family 61 (GH61) polypeptide and was shown to have little homology with known CBM's indi- 
5 eating that it is the first known member of a new family of CBM's. The present invention fur- 
ther relates to CBM's preferably exhibiting binding affinity for cellulose; to a method of produc- 
ing such CBM's; and to methods for using such CBM's in the textile, detergent and cellulose 
fiber processing industries, for purification of polypeptides, immobilisation of active enzymes, 
baking, manufacturing of biofuel, modification of plant cell walls. 
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Patent- og 
Varemaerkertyrelsen 

SEQUENCE LISTING 

3 0 OKL 2003 

<110> Novozymes A/S 

Modtaget 

<120> Fungal carbohydrate -binding module 
<130> 10499. OOO-DK 
<160> 9 

<170> Patentln version 3.2 

<210> 1 

<211> 629 

<212> DNA 

<213> Pseudoplectania nigrella 
<220> 

<221> CDS 

<222> (10).. (531) 

<400> 1 

gaattcaaa atg gtc aac ttc acc acc etc etc ccg gtt ctt gee get ctt 51 
Met Val Asn Phe Thr Thr Leu Leu Pro Val Leu Ala Ala Leu 
15 10 

att gga get gee aat gee cac act cgt gtc tac gga etc tec gtc aac 99 
lie Gly Ala Ala Asn Ala His Thr Arg Val Tyr Gly Leu Ser Val Asn 
15 20 25 30 

gat gtc aca tec tec ggc acc tec aat gac aag gee gtc get tct tec 147 
Asp Val Thr Ser Ser Gly Thr Ser Asn Asp Lys Ala Val Ala Ser Ser 
35 40 45 

agt att gcg gec gtg gac cct gtg acc age tec gtc gta gec tct gtt 195 
Ser lie Ala Ala Val Asp Pro Val Thr Ser Ser Val Val Ala Ser Val 
50 55 60 

cag gtc cct aac ttc act gee act gac gtc ccc act ttt act gec acc 243 
Gin Val Pro Asn Phe Thr Ala Thr Asp Val Pro Thr Phe Thr Ala Thr 
65 70 75 

gac ate cct act ttc act get act gat gtt cct ate ttc acc aag aag 291 
Asp lie Pro Thr Phe Thr Ala Thr Asp Val Pro lie Phe Thr Lys Lys 
80 85 90 

ccc caa cag ccc tea act tta ttg acc cgc acc cgt acc cat gec tct 339 
Pro Gin Gin Pro Ser Thr Leu Leu Thr Arg Thr Arg Thr His Ala Ser 
95 100 105 110 

gtt tea ttc gtc get aag ccc tec get ttt att ccc aag cct tec gcg 387 
Val Ser Phe Val Ala Lys Pro Ser Ala Phe lie Pro Lys Pro Ser Ala 
115 120 125 

age aca ate ccg tea aag ccc aag act ccc gaa gag gtt aat aag tgc 435 
Ser Thr He Pro Ser Lys Pro Lys Thr Pro Glu Glu Val Asn Lys Cys 
130 135 140 

ctt gac get gtc aac gee tgt att aca cag gee cag age tec att gga 483 
Leu Asp Ala Val Asn Ala Cys He Thr Gin Ala Gin Ser Ser He Gly 



145 150 155 

gga gtt gtc aac ttt gag cct tgc gag age cag aga get ctt tgc tat 531 
Gly Val Val Asn Phe Glu Pro Cys Glu Ser Gin Arg Ala Leu Cys Tyr 
160 165 170 

taggaactgc aaagaatctg gggggatggt agcgaggfctg agaggtggag gageggagga 591 

gtaggggagg tgagatggag taagattaag cggccgca 629 

<210> 2 
<211> 174 
<212> PRT 

<213> Pseudoplectania nigrella 
<400> 2 

Met Val Asn Phe Thr Thr Leu Leu Pro Val Leu Ala Ala Leu lie Gly 
15 10 15 

Ala Ala Asn Ala His Thr Arg Val Tyr Gly Leu Ser Val Asn Asp Val 
20 25 30 

Thr Ser Ser Gly Thr Ser Asn Asp Lys Ala Val Ala Ser Ser Ser He 
35 40 45 

Ala Ala Val Asp Pro Val Thr Ser Ser Val Val Ala Ser Val Gin Val 
50 55 60 

Pro Asn Phe Thr Ala Thr Asp Val Pro Thr Phe Thr Ala Thr Asp He 

65 70 75 80 

Pro Thr Phe Thr Ala Thr Asp Val Pro He Phe Thr Lys Lys Pro Gin 
85 90 95 

Gin Pro Ser Thr Leu Leu Thr Arg Thr Arg Thr His Ala Ser Val Ser 
100 105 110 

Phe Val Ala Lys Pro Ser Ala Phe He Pro Lys Pro Ser Ala Ser Thr 
115 120 125 

He -Pro Ser Lys Pro Lys Thr Pro Glu Glu Val Asn Lys Cys Leu Asp 
130 135 140 

Ala Val Asn Ala Cys He Thr Gin Ala Gin Ser Ser He Gly Gly Val 
145 150 155 160 



Val Asn Phe Glu Pro Cys Glu Ser Gin Arg Ala Leu Cys Tyr 
165 170 



<210> 3 

<211> 31 

<212> DNA 

<213> Pseudoplectania nigrella 



<220> 

<221> misc_feature Primer NP887U1 
<222> (1)..(31) 

<400> 3 

gacatcgttg acggagagtc cgtagacacg a 



<210> 4 

<211> 34 

<212> DNA 

<213> Pseudoplectania nigrella 



<220> 

<221> misc_feature Primer NP887D1 
<222> (1) . • <34> 

<400> 4 

acatcctccg gcacctccaa tgacaaggcc gtcg 



<210> S 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer PNA2I 



<220> 

<221> misc_feature Primer PNA2I 
<222> (1) . . (21) 

<400> S 

gtttccaact caatttacct c 



<210> 6 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer NP887Daul 



<220> 

<221> misc_feature Primer NP887Daul 
<222> (1) . . (32) 

<400> 6 

ccaaagcttt tcatcctccg gcacctccaa tg 



<210> 7 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer N887Dau2 



<220> 

<221> misc_feature Primer NP887Dau2 
<222> (1) . . (32) 

<400> 7 

gcgaagctta atcttactcc atctcacctc cc 32 



<210> 8 

<211> 573 

<212> DNA 

<213> Pseudoplectania nigrella 



<220> 

<221> CDS 

<222> (1) . . (570) 

<223> Positions 1-57 Candida lipase signal peptide, positions 58-147 
Candida lipase sequence, positions 148-570 P. nigrella CBM 
polypeptide . 

<400> 8 

atg aag eta etc tct ctg acc ggt gtg get ggt gtg ctt gcg act tgc 48 
Met Lys Leu Leu Ser Leu Thr Gly Val Ala Gly Val Leu Ala Thr Cys 
15 10 15 

gtt gca gec act cct ttg gtg aag tgc gca act agt ggc cat tac ggc 96 
Val Ala Ala Thr Pro Leu Val Lys Cys Ala Thr Ser Gly His Tyr Gly 
20 25 30 

etc gcg agg ccg cct egg ccc caa cga att ctt gga ata tta age ttt 144 
Leu Ala Arg Pro Pro Arg Pro Gin Arg lie Leu Gly lie Leu Ser Phe 
35 40 45 

tea tec tec ggc acc tec aat gac aag gec gtc get tct tec agt att 192 
Ser Ser Ser Gly Thr Ser Asn Asp Lys Ala Val Ala Ser Ser Ser He 
50 55 60 

gcg gec gtg gac cct gtg acc age tec gtc gta gec tct gtt cag gtc 240 
Ala -Ala Val Asp Pro Val Thr Ser Ser Val Val Ala Ser Val Gin Val 
65 70 75 80 

cct -aac ttc act gec act gac gtc ccc act ttt act gee acc gac ate 288 
Pro Asn Phe Thr Ala Thr Asp Val Pro Thr Phe Thr Ala Thr Asp He 
85 90 95 

cct act ttc act get act gat gtt cct ate ttc acc aag aag ccc caa 336 
Pro Thr Phe Thr Ala Thr Asp Val Pro He Phe Thr Lys Lys Pro Gin 
100 105 110 



cag ccc tea act tta ttg acc cgc acc cgt acc cat gec tct gtt tea 384 
Gin Pro Ser Thr Leu Leu Thr Arg Thr Arg Thr His Ala Ser Val Ser 
115 120 125 

ttc gtc get aag ccc tec get ttt att ccc aag cct tec gcg age aca 432 
Phe Val Ala Lys Pro Ser Ala Phe lie Pro Lys Pro Ser Ala Ser Thr 
130 135 140 

ate ccg tea aag ccc aag act ccc gaa gag gtt aat aag tgc ctt gac 4 80 

lie Pro Ser Lys Pro Lys Thr Pro Glu Glu Val Asn Lys Cys Leu Asp 
145 150 155 160 

get gtc aac gee tgt att aca cag gee cag age tec att gga gga gtt 528 
Ala Val Asn Ala Cys He Thr Gin Ala Gin Ser Ser He Gly Gly Val 
165 170 175 

gtc aac ttt gag cct tgc gag age cag aga get ctt tgc tat tag 573 
Val Asn Phe Glu Pro Cys Glu Ser Gin Arg Ala Leu Cys Tyr 
180 185 190 



<210> 9 

<211> 190 

<212> PRT 

<213> Pseudoplectania nigrella 

<400> 9 

Met Lys Leu Leu Ser Leu Thr Gly Val Ala Gly Val Leu Ala Thr Cys 
1 5 10 15 



Val Ala Ala Thr Pro Leu Val Lys Cys Ala Thr Ser Gly His Tyr Gly 
20 25 30 



Leu Ala Arg Pro Pro Arg Pro Gin Arg He Leu Gly He Leu Ser Phe 
35 40 45 



Ser Ser Ser Gly Thr Ser Asn Asp Lys Ala Val Ala Ser Ser Ser He 
50 55 60 



Ala Ala Val Asp Pro Val Thr Ser Ser Val Val Ala Ser Val Gin Val 
65 70 75 80 



Pro Asn Phe Thr Ala Thr Asp Val Pro Thr Phe Thr Ala Thr Asp He 
85 90 95 



Pro Thr Phe Thr Ala Thr Asp Val Pro He Phe Thr Lys Lys Pro Gin 
100 105 110 



Gin Pro Ser Thr Leu Leu Thr Arg Thr Arg Thr His Ala Ser Val Ser 
115 120 125 



Phe Val Ala Lys Pro Ser Ala Phe He Pro Lys Pro Ser Ala Ser Thr 



130 135 140 



lie Pro Ser Lys Pro Lys Thr Pro Glu Glu Val Asn Lys Cys Leu Asp 
145 150 155 160 



Ala Val Asn Ala Cys lie Thr Gin Ala Gin Ser Ser lie Gly Gly Val 
165 170 175 



Val Asn Phe Glu Pro Cys Glu Ser Gin Arg Ala Leu Cys Tyr 
180 185 190 



